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Abstract 

The  purpose  of  this  study  was  to  implement  an  autonomous  face  segmentor  as 
the  front  end  to  a  face  recognition  system  on  a  Sun  SPARCStation2.  Face  recogni¬ 
tion  performance  criteria,  specifically,  the  capabilities  to  isolate  and  resize  faces  in 
an  image  to  a  consistent  scale,  were  analyzed  to  determine  current  practical  limi¬ 
tations.  Face  images  were  acquired  using  a  S-VHS  camcorder.  Segmentation  was 
accomplished  using  motion  detection  and  pre-defined  rules.  Tests  were  run  to  deter¬ 
mine  the  suitable  of  the  autonomous  segmentor  as  the  front-end  to  a  face  recognition 
system.  The  segmentation  system  developed  consistently  located  faces  and  rescaled 
those  faces  to  a  normalized  scale  for  subsequent  recognition. 


AUTONOMOUS  FACE  SEGMENTATION 


/.  Problem  Description 


1.1  Introduction 

Autonomous  face  recognition  systems  have  many  potential  applications,  such 
as  scanning  airports  for  criminals  and  verifying  users  at  bank  teller  machines.  Cur¬ 
rent  systems,  however,  are  primarily  research  tools,  ill-equipped  for  practical  appli¬ 
cations.  This  research  focuses  on  some  of  the  major  obstacles  to  practical  face  recog¬ 
nition  by  constructing  an  autonomous  face  segmentor  which  will  operate  as  the  front 
end  to  a  complete,  autonomous  face  recognition  system  residing  on  a  Sun  SPARC- 
Station2.  Complementary  theses  by  Krepp  and  Runyon  will  focus  on  the  remaining 
portions  of  this  complete  face  recognition  system  (14)(22).  The  face  recognition 
system  is  intended  for  use  in  a  realistic  office  environment. 

This  problem  description  begins  with  a  background  review  of  the  most  relevant 
face  recognition  research,  followed  by  the  problem  statement.  Next,  the  research 
objectives,  assumptions,  scope,  and  standards  are  stated.  The  approach  is  then 
presented,  and  the  chapter  concludes  with  an  overview  of  the  remaining  chapters. 

1.2  Background. 

1.2.1  Suarez  and  Goble.  Suarez  and  Goble  researched  face  recognition  at 
the  Air  Force  Institute  of  Technology  using  the  Karhunen  Loeve  and  Discrete  Co¬ 
sine  transforms,  respectively  (24)(7).  Their  recognition  accuracies  were  respectable; 
however,  face  segmentation  was  done  manually,  and  recognition  software  resided  in 
pieces  on  several  machines. 


1 


A  face  recognition  system  being  developed  by  Runyon  is  based  on  the  tech¬ 
niques  developed  by  Suarez’s  research.  This  face  segmentation  research  will  be  the 
front  end  for  Runyon’s  recognition  system. 

1.2.2  Turk  and  Pentland.  Turk  and  Pentland,  from  the  Massachusetts  In¬ 
stitute  of  Technology,  constructed  a  completely  autonomous  face  recognition  system 
(25).  Like  Suarez,  they  used  Karhunen  Loeve  transforms  for  the  recognition  process. 
Their  system  used  three  processors;  the  first  two  dedicated  to  segmentation,  and 
the  third,  recognition.  Their  segmentation  technique  uses  motion  detection,  motion 
analysis,  and  the  error  in  reconstructed  images.  Turk  and  Pentland  did  not  address 
the  limitations  of  their  system  in  a  practical  environment  although  the  amount  of 
processing  power  required  is  an  obvious  drawback. 

The  concepts  of  using  motion  to  find  faces  is  directly  applicable  to  this  research, 
but  the  processing  power  will  be  significantly  different. 

1.2.3  Lambert’s  AFRM.  The  AFRM  (Autonomous  Face  Recognition  Ma¬ 
chine)  was  developed  at  the  Air  Force  Institute  of  Technology  before  Lambert’s 
research,  but  it  was  his  enhancements  which  make  the  execution  time  and  segmenta¬ 
tion  results  respectable.  Lambert’s  segmentation  technique  used  motion  detection, 
brightness  normalization,  and  “face”  brightness  patterns.  The  system  as  a  whole  did 
not  meet  expectations,  and  it  was  clearly  not  useful  in  a  practical  environment. 

The  motion  detection  and  brightness  normalization  are  the  concepts  most  ap¬ 
plicable  to  current  research.  His  recognition  technique,  which  is  based  on  cortical 
thought  theory,  takes  more  time  and  is  less  accurate  than  current  techniques. 

1.3  Problem  Statement 

Current  face  recognition  systems  are  not  yet  practical  for  commercial  use.  The 
purpose  of  this  study  is  to  investigate  limitations  of  autonomous  face  segmentation 


which  is  part  of  a  complete,  autonomous  face  recognition  system  operating  in  a 
realistic  environment. 

1.4  Research  Objectives 

The  objective  of  this  research  is  to  uncover,  and  overcome,  the  major  obstacles 
to  autonomous  face  segmentation  for  a  single  workstation  autonomous  face  recogni¬ 
tion  system. 

1.5  Assumptions 

The  face  segmentation  assumes: 

1.  A  fixed-position  imaging  device. 

2.  Fixed  focal  length. 

3.  Automatic  intensity  compensation. 

4.  Continuous  head  movement. 

5.  No  movement  directly  behind  the  system  user  which  is  shoulder  height  or 
higher. 

6.  Consistent  office  lighting. 

1.6  Scope 

The  scope  of  this  thesis  is  to  investigate  the  limitations  of  an  autonomous  face 
segmentation  pvstem  operating  on  a  Sun  SPARCstation2. 

1.7  Standards 

The  performance  criterion  for  this  face  segmentation  is  a  suitable  face  for  face 
recognition.  The  key  criterion  for  the  face  recognition  system  is  classification  ac¬ 
curacy,  as  well  as  user  interaction  and/or  constraints.  The  key  constraints  for  face 
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segmentation  with  regard  to  classificaiion  accuracy  in  this  face  recognition  system 
are  : 

1.  Minimal  background  in  each  image,  and 

2.  Consistent  scale  across  all  images. 

The  key  user  constraints  to  minimize  for  face  segmentation: 

1.  Tailored  backgrounds,  and 

2.  User  actions. 

1.8  Approach/Methodology 

A  Super  VHS  camcorder  connected  to  a  Sun  SPARCStation2  with  a  VideoPix 
frame  grabber  card  provides  the  input  imaging  capability  of  the  system.  A  software 
environment  developed  and  executing  on  the  Sun  SPARCstation2  performs  the  face 
segmentation  and  recognition.  This  software  combines  commercial  software  with 
new  software  written  in  “C.” 

1.9  Overview 

Chapter  Two  presents  a  review  of  current  literature  related  to  face  recognition 
systems.  Chapter  Three  provides  a  detailed  description  of  the  methodology  used  in 
this  thesis,  and  Chapter  Four  provides  test  results.  Chapter  Five  presents  conclusions 
based  on  the  test  results  and  makes  recommendations  for  future  study. 
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11.  Literature  Review 


2.1  Introduction 

When  people  look  at  photographs  of  friends,  or  run  into  a  co-worker  at  the 
supermarket,  or  talk  with  family  members  at  the  dinner  table,  they  probably  never 
stop  to  consider  how  they  recognize  these  people.  Recognizing  people  is  so  com¬ 
monplace  to  humans  they  usually  don’t  think  about  how  remarkable  this  process 
is.  Faces  are  one  of  the  most  common  discriminators  humans  use  to  recognize  peo¬ 
ple,  and  humans  are  very  good  at  recognizing  faces  (experiments  have  shown  that 
monkeys,  sheep,  and  even  pigeons  are  also  very  good  at  recognizing  human  faces) 
(19)(12)(21:35).  Yet,  as  seemingly  effortless  as  this  process  is  for  humans  to  perform, 
no  one  has  been  able  to  build  a  machine  that  performs  anywhere  near  as  well  as  a 
pigeon,  let  alone  a  human.  In  spite  of  this  fact,  there  are  some  practical  applications 
which  would  benefit  from  a  high  confidence  face  recognition  system.  Scanning  for 
criminals  in  airports  or  verifying  account  owners  at  a  bank  teller  machine  are  just 
two  of  many  possible  examples.  One  of  the  requirements  for  this  thesis  is  to  build 
an  autonomous  near-real  time  face  recognition  system.  Given  the  performance  dis¬ 
parity  between  humans  and  machines,  it  seems  prudent  to  review  the  latest  research 
regarding  biological  face  recognition,  as  well  as  the  latest  face  recognition  technology. 
The  results  of  this  literature  investigation  are  provided  in  the  following  section.  The 
final  section  of  this  chapter  will  briefly  summarize  the  current  status  of  face  recogni¬ 
tion  technology  as  found  in  the  literature  search,  and  state  what  can  be  concluded 
regarding  research  objectives. 

2.2  Face  Recognition  Research 

One  of  the  most  interesting  facts  about  investigating  how  the  brain  “does”  face 
recognition  is  that  no  one  knows  how  the  brain  “does”  any  process,  face  recognition 
included.  There  are,  however,  many  hypotheses;  some  of  which  enjoy  greater  accep>- 
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tance  than  others.  The  latest  trends  in  face  lecognition  research  indicate  that  there 
are  at  least  two  major  subprocesses  in  recognizing  faces.  First,  taking  in  a  visual 
scene  and  realizing  there  is  a  face  in  that  scene  at  all.  This  detection  of  a  face  is  not 
recognition  of  a  specific  person’s  face,  it  is  simply  realizing  that  a  face  exists  in  the 
scene  (for  example,  as  someone  looks  around,  he  realizes  there  is  a  face  looking  at 
him  from  the  bushes).  This  process  is  referred  to  as  segmentation.  The  second  major 
subprocess  is  recognizing  that  a  face  “belongs  to”  a  specific  person  (returning  to  the 
example,  upon  further  study,  he  realizes  it  is  that  weird  kid  down  the  street  that  is 
looking  at  him  from  the  bushes).  This  second  process  is  usually  referred  to  as  face 
recognition  by  itself,  but  it  will  be  referred  to  herein  as  individual  face  recognition 
to  distinguish  it  from  the  overall  process. 

2.2.1  Face  Segmentation  When  humans  look  at  a  photograph  of  someone, 
they  can  immediately  determine  if  there  is  a  face  in  that  picture,  but  how  do  they 
know  this?  Some  research  suggests  humans  have  cells  which  respond  uniquely  to 
faces,  regardless  of  whose  face  it  is  or  how  it  is  represented  (e.g.,  photograph,  line 
drawing,  or  some  other  representation)  (20)(19)(9).  Face  sensitive  cells  suggest  that 
face  segmentation  is  a  biological  process.  Support  for  this  hypothesis  can  be  found  in 
the  research  conducted  on  prosopagnosia  patients.  These  people  exhibit  no  deficit 
in  the  ability  to  find  and  discriminate  faces,  but  they  cannot  identify  the  person  to 
whom  the  face  belongs  (4)(3)(18). 

Other  research  suggests  that  face  sensitive  cells  are  excited  by  particular  com¬ 
binations  of  cells  (i.e.,  patterns)  on  the  visual  cortex.  These  cells  on  the  visual  cortex 
respond  as  if  a  localized  frequency  analysis  has  been  applied  to  the  visual  field  (10). 
These  experiments  have  spawned  research  with  wavelet  segmentation;  wavelets  per¬ 
form  a  localized  frequency  analysis.  Wavelet  segmentation  results  with  tanks  and 
such  are  encouraging  (23),  but  very  little  face  segmentation  with  v;avelets  has  been 
attempted  (26). 
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While  the  biological  research  on  face  segmentation  progresses,  other  researchers 
have  designed  and  built  systems  to  segment  the  faces  in  an  image.  The  success 
rate  of  these  systems  has  been  modest,  but  may  be  sufficient  for  some  applications. 
These  systems  find  faces  by  using  expected  brightness  patterns  to  find  areas  in  the 
image  which  may  be  the  eyes  (1)(15)(16)  or  face  outline  (27)(8)(16).  To  reduce  the 
computational  load,  most  systems  use  pre-defined  rules,  such  as  head  size  (25)(8)  or 
motion  (25)  to  reduce  the  possibility  of  falsely  identifying  a  non-face  object  as  a  face. 
To  speed  up  the  segmentation  process,  some  face  recognition  systems  use  motion  to 
limit  the  search  area  (25)(15).  The  underlying  assumption  of  this  technique  is  that 
people  almost  never  keep  their  heads  perfectly  still.  Experimental  results  have  shown 
this  to  be  a  reasonable  assumption.  Using  motion  is  reasonable  from  a  biological 
standpoint,  as  well,  since  research  has  shown  that  (a)  some  animals  use  motion  to 
detect  objects  and  (b)  we  have  cells  which  respond  solely  to  motion  (11).  Even  with 
motion  and  pre-defined  rules,  segmentation  is  usually  the  most  time  consuming  and 
computationally  intensive  portion  of  any  autonomous  face  recognition  system. 

2.2.2  Individual  Face  Recognition  Once  the  face  is  found  in  the  image, 
the  face  recognition  process  must  be  completed  by  identifying  the  individual  to  whom 
the  face  belongs  (individual  face  recognition).  This  is  true  regardless  of  whether 
we  are  working  with  a  biological  or  man-made  process.  The  two  most  common 
approaches  to  individual  face  recognition  are  feature-based  and  holistic.  Biological 
experiments  have  been  cited  to  support  feature-based  and  holistic  recognition  (2), 
although  neither  approach  can  claim  superiority  based  on  the  experimental  results. 
Recent  studies  suggest  humans  may  do  both  feature-based  and  holistic  recognition 
in  parallel  (5:6). 

The  feature-based  recognition,  as  might  be  guessed  from  the  name,  uses  mea¬ 
surements  based  on  the  features  of  a  face  (e.g.,  eyes,  nose,  and  mouth)  to  characterize 
a  person’s  face.  Which  features  are  the  right  ones  to  use,  or  which  features  does  the 
brain  use?  We  don’t  know.  Researchers  have  tried  a  variety  of  measurements  with 
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modest  results.  Feature-based  techniques  were  used  in  the  early  seventies  when  face 
recognition  was  first  attempted  by  computers,  and  feature-based  research  still  con¬ 
tinues  today.  One  of  the  most  significant  limitations  of  the  feature-based  approach  is 
the  autonomous  segmentation  of  the  features.  One  recently  developed  autonomous 
feature-based  face  recognition  system  required  forty  minutes  on  a  VAX  8530  just  to 
find  the  face  and  get  it  into  a  format  the  system  could  use  to  extract  the  feature 
measurements  (16). 

The  holistic  approach  uses  the  “entire  face”,  not  just  selected  features,  as  an 
input  to  the  individual  recognition  process.  The  holistic  research  has  been  based  pri¬ 
marily  on  data  compression  techniques  which  have  the  advantage  of  representing  the 
entire  image  with  just  a  few  coefficients  (6)(13)(7).  In  fact,  face  recognition  systems 
have  been  built  which  use  one  of  three  following  data  compression  techniques; 

•  The  Karhunen  Loeve  Transform  (25)(13)(24) 

•  The  Discrete  Cosine  Transform  (7) 

•  A  Neural  Network  with  Unsupervised  Feature  Extraction  (6) 

There  is  no  clear  advantage  to  using  any  one  of  these  three  techniques  based  on 
recognition  success  rates;  recognition  success  rates  were  essentially  equivalent  for 
systems  using  any  one  of  these  techniques.  An  additional,  and  perhaps  more  sig¬ 
nificant,  advantage  of  holistic  recognition  is  that  rigorous  segmentation  may  not  be 
required;  the  location  of  the  face  is  needed,  but  not  individual  features.  Face  recog¬ 
nition  research  using  the  holistic  approach  has  been  around  for  only  a  few  years,  but 
results  have  been  encouraging. 

2.3  Summary  and  Conclusions 

Current  autonomous  segmentation  techniques  have  had  only  limited  success 
in  locating  faces  in  an  image.  To  improve  their  success  rate,  most  systems  make  a 
number  of  assumptions  about  the  objects  that  will  appear  in  the  visual  field.  These 
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assumptions  severely  limit  the  usefulness  of  these  systems  in  practical  applications. 
A  secondary  issue  is  the  computational  burden  and  execution  time.  Although  motion 
and  pre-defined  rules  may  reduce  the  execution  and  the  computational  burden,  they 
intrinsicly  make  further  assumptions  about  the  system’s  environment  and,  therefore, 
further  limit  their  portability  to  new  environments.  Face  segmentation  with  wavelets 
may  benefit  the  face  segmentation  field,  but  that  research  is  still  open. 

The  individual  face  recognition  portion  of  the  face  recognition  process  can  be 
done  by  current  equipment  with  reasonable  accuracies  using  either  the  feature-based 
or  holistic  approach.  A  major  drawback  to  feature-based  recognition  is  the  rigorous 
segmentation  feature-based  systems  require;  as  discussed  above,  current  segmenta¬ 
tion  techniques  are  limited.  Holistic  recognition  is  promising,  partly  because  it  does 
not  appear  to  require  rigorous  segmentation  to  achieve  reasonable  results. 

As  mentioned  earlier,  one  of  the  requirements  for  this  thesis  is  to  build  a  portion 
of  an  autonomous  near-real  time  face  recognition  system,  and  sponsor  requirements 
dictate  this  system  will  be  integrated  into  a  Sun  SPARCstation2.  Given  the  results 
of  the  literature  search,  a  motion-aided  holistic  face  recognition  system  seems  to  be 
the  most  reasonable  solution  given  the  technology  available  today.  The  assumptions 
about  motion  are  acceptable  for  the  intended  environment  of  this  system,  and  if 
better  segmentation  techniques  become  available  they  can  be  easily  integrated  into 
this  type  of  system. 
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III.  Methodology 


3.1  General 

The  main  objective  of  this  thesis  was  to  implement  segmentation  techniques  on 
a  Sun  workstation  to  determine  the  capabilities  of  these  techniques  to  segment  a  face 
from  an  input  scene  which  is  suitable  for  use  as  an  input  to  a  face  recognition  system. 
The  approach  of  this  research  was  to  use  an  algorithm  to  analyze  the  movement 
between  two  successive  images  to  find  possible  face  regions.  This  approach  is  based 
on  the  fact  that  humans  do  not  keep  their  heads  perfectly  still;  a  certain  amount  of 
motion  is  always  present  (17).  A  side  benefit  to  this  approach  is  that  if  this  technique 
cannot  segment  suitable  faces  by  itself,  it  can  be  used  by  other  techniques  to  find 
areas  of  interest. 

3.2  Motion  Analysis 

3.2.1  Capturing  Images  The  first  step  was  to  develop  the  capability  to 
capture  images  and  bring  them  into  the  Sun  for  analysis.  The  tools  used  were  the 
Sun  VideoPix  and  a  S-VHS  camcorder.  The  reasons  for  choosing  the  VideoPix  were 
as  follows; 

1.  The  tool  capability  seemed  adequate  for  our  application.  The  specification 
stated  it  could  capture  up  to  four  grey-scale  images  per  second;  color  images 
at  up  to  one  per  second. 

2.  The  tool  can  be  controlled  via  “C”  routines  which  come  with  the  tool. 

3.  The  hardware  and  software  were  available  and  already  integrated  into  a  Sun 
workstation. 

ThejS-VHS  camcorder  was  chosen  because  it  provided  automatic  brightness  con¬ 
trol,  and  could  supply  color  or  grey-scale,  live  or  video-taped  images.  Appendix  A 
contains  information  on  using  VideoPix. 
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The  images  provided  by  the  VideoPix  tool  are  720  by  480  non-square  pixel 
YUV  images,  where  Y  is  luminance  and  UV  are  chrominance^  The  conversion  to 
NTSC  format  square  pixel  data  produces  a  640  by  480  image.  The  decision  was 
made  to  use  8- bit  grey  scale  images  because  these  types  of  images  had  been  used 
successfully  for  both  motion  and  recognition  in  the  past  (25)(15). 

3.2.2  Finding  Motion  Regions  After  developing  the  capability  to  capture 
images  on  the  Sun  workstation,  the  next  step  was  to  find  the  regions  of  movement. 
The  technique  for  finding  moving  regions,  and  one  that  had  been  used  successfully  in 
previously  research  (25)(15),  was  frame-to-frame  subtraction.  The  reasoning  behind 
this  approach  is,  given  that  the  camera  and  focal  region  remain  fixed,  the  pixel  values 
that  change  between  two  images  are  considered  regions  of  movement. 

One  obvious  consideration  is  the  time  lag  that  occurs  between  capturing  the 
first  and  second  images.  If  this  time  lag  is  too  large,  the  resolution  of  the  motion 
regions  deteriorates.  This  time  lag  was  measured  using  the  Unix  date  command. 
Additionally,  the  motion  region  created  when  a  subject  greatly  exaggerates  his  mo¬ 
tion  was  determined.  Although  the  acquisition  rate  for  the  VideoPix  tool  i?  fairly 
slow,  it  is  fast  enough  that  a  subject  would  have  to  intentionally  exaggerate  his  mo¬ 
tion  to  cause  any  significant  deterioration  of  the  motion  image,  and  in  this  research 
a  cooperative  user  was  assumed. 

Noise  was  filtered  out  with  a  median  filter;  a  median  filter  assigns  pixel  values 
based  on  the  median  value  of  the  pixels  surrounding  a  pixel  as  well  as  the  pixel  value 
itself. 

As  can  be  seen  in  Figure  1,  the  “motion  regions”  that  result  from  this  approach 
are  not  usually  solid  regions,  but  rather  loose  outlines  of  moving  objects,  with  gaps 
in  the  outline,  and  random  patterns  of  pixels  internal  to  the  object  “outline”  which 

'  More  detail  on  the  YUV  data  is  given  in  Appendix  A 
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depend  on  both  the  pattern  of  the  moving  object  and  the  motion  internal  to  that 
object  (i.e.,  blinking  eyes  or  moving  mouth). 


Figure  1.  Typical  Motion  Image 


To  form  a  defined  region,  the  “motion”  pixel  closest  to  the  top  was  found  for 
each  column  of  the  image  and  all  pixels  below  that  pixel  were  considered  part  of  the 
motion  region.  An  example  of  this  modified  image  can  be  seen  in  Figure  2. 

The  decision  to  find  the  pixel  closest  to  the  top  was  based  on: 

1.  Many  times  the  object  “outlines”  were  so  poor  that  no  standard  algorithm 
could  be  used  to  simply  “close  the  gaps.” 

2.  Given  the  target  environment  for  this  system  (office  environment),  it  was  as¬ 
sumed  that  if  a  person  was  in  the  scene  there  would  not  be  motion  above  that 
person’s  head.  It  was  recognized  that  a  second  person  standing  directly  behind 
a  person  sitting  in  a  chair,  would  obscure  the  sitting  person;  however,  that  was 
felt  to  be  an  acceptable  limitation. 

3.  The  top  portion  of  the  motion  outlines  is  the  most  consistent  outline  edge. 
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Figure  2.  Motion  Outline  I 


To  close  small  gaps  that  still  exist  in  the  modified  motion  image,  the  columns  of  this 
image  were  then  grouped,  and  all  columns  in  a  particular  group  were  assigned  the 
“highest”  motion  pixel  value  for  any  column  within  that  group.  An  example  of  this 
image  is  shown  in  Figure  3. 

As  is  evident  from  Figure  3,  random  noise  or  large  gaps  in  the  motion  outline 
occasionally  caused  “spikes”  to  occur  in  the  motion  region.  Therefore,  single  column- 
group  spikes  were  eliminated  by  looking  for  large  changes  in  motion  pixel  location 
from  one  column  to  the  next  followed  by  another  large  change  in  motion  pixel  location 
in  the  opposite  direction.  An  example  of  the  resulting  “cleaned  up”  motion  image  is 
provided  in  Figure  4. 

Alternately,  motion  regions  formed  by  subtracting  two  color  (rgb)  images  were 
tested  to  determine  if  there  was  any  advantages  to  using  color  images  instead  of  grey 
scale  to  determine  the  motion  regions. 

3.2.3  Segmentation  by  Region  Analysis  Once  motion  regions  were  de¬ 
fined,  these  regions  were  then  analyzed  to  find  features  which  were  common  to  all 
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Figure  3.  Motion  Outline  II 


Figure  4.  Final  Motion  Outline 


face  regions.  A  secondary  analysis  was  to  find  features  which  were  unique  only  to 
face  regions,  but  this  was  only  secondary  because  it  was  felt  that  later  processing 
could  eliminate  non-faces  as  long  as  the  segmentation  was  providing  suitable  face 
regions  when  they  were  part  of  the  input  scene. 

Because  (a)  motion  regions  were  created  by  “blanketing”  objects  from  the  top, 
and  (b)  human  heads  are  tall  oblong  shapes  on  top  of  a  wider  shape  (shoulders  and 
body),  the  difference  between  the  height  of  the  shoulders  and  the  top  of  the  head  is 
manifested  as  a  dramatic  change,  or  large  slope,  in  the  motion  region.  The  analysis, 
therefore,  simply  looks  for  a  large  positive  slope,  a  fairly  stable  region,  and  then  a 
large  negative  slope.  The  entire  image  is  searched  for  regions  which  fit  this  profile 
over  several  slope  thresholds.  Multiple  slope  thresholds  are  an  attempt  to  overcome 
the  variations  in  subjects  and  the  “quality”  of  the  motion  outline. 

Once  the  portions  of  the  motion  region  had  been  found  which  fit  the  “head” 
profile,  these  regions  were  “cut”  out  of  the  image  and  rescaled  to  a  standard  size.  It 
was  hoped  that  since  the  region  segmented  out  of  the  original  image  was  dependent 
on  the  width  of  the  moving  head,  that  rescaling  the  segmented  region  to  a  standard 
size  would  provide  sufficient  scale  invariance  for  an  automatic  recognition  system. 
The  scale  invariance  was  tested  by  segmenting  the  face  of  a  subject  at  three  different 
camera  “zoom”  settings.  These  camera  settings  significantly  altered  the  scale  of 
the  subject’s  head  in  the  image  input.  The  segmented  faces  were  then  visually 
inspected  to  determine  how  well  the  segmentation  algorithm  “normalized”  the  scale 
given  the  three  scale  variations  in  the  input  images.  Faces  segmented  during  the 
previous  segmentation  tests  were  also  visually  inspected  to  further  determine  the 
scale  consistency  of  faces  segmented  by  this  technique.  The  actual  “sufficiency”  of 
this  scale  normalization  for  a  recognition  system  may  be  tested  in  a  complementary 
thesis  by  Ken  Runyon  (22). 
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3.3  Binarized  Face  Patterns 

To  provide  an  alternate  method  of  finding  the  faces  in  an  image,  it  seemed 
reasonable  to  search  for  specific  features  of  faces  in  the  image.  One  technique  used 
previously  at  AFIT  which  appeared  to  be  fairly  successful  was  to  “lambertize”  an 
image  and  then  search  for  features  such  as  eyes,  nose,  and  mouth  (15).  To  over¬ 
come  feature  obscuring  due  to  brightness  variations  across  a  scene,  Lambert  found 
the  brightness  variations  within  some  window  of  the  visual  scene.  The  variations 
remained  fairly  consistent  even  when  the  overall  brightness  of  the  image  change  rad¬ 
ically.  Thus,  by  finding  these  local  variations,  it  was  hoped  that  facial  features  would 
appear  consistently  in  a  “lambertized”  image. 

Local  variations  in  an  image  were  found  using  Lambert’s  technique  and  several 
different  window  sizes.  The  input  image  was  then  reduced  and  then  local  variations 
were  found  in  the  reduced  image  as  well.  Algorithms  for  finding  faces,  based  on  these 
images,  may  be  developed  in  future  research. 

3.4  Summary 

The  face  segmentation  approach  taken  in  this  research  can  be  summarized  as; 

1.  Capture  two  images  from  a  fixed  camera  in  rapid  succession. 

2.  Perform  frame-to-frame  subtraction  to  determine  movement. 

3.  Eliminate  noise  from  the  motion  image. 

4.  Group  pixels  to  form  a  more  defined  motion  region. 

5.  Analyze  the  motion  region  to  find  face  regions. 

6.  Cut  these  face  regions  out  of  the  input  image. 

7.  Resize  these  face  regions  to  create  a  standard  size  vector  (these  vectors  become 

the  input  to  a  face  recognition  system). 
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The  details  of  this  algorithm  and  the  tests  conducted  to  investigate  this  ailgorithm 
have  been  described  in  this  chapter.  The  results  of  these  tests  are  documented  in 
the  chapter  that  follows. 
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IV.  Results 


4-1  General 

4.2  Capturing  Images 

The  VideoPix  hardware  and  software  performed  as  expected.  An  example  of 
two  8-bit  grey  scale  images  captured  in  a  software  loop  are  provided  in  Figure  5. 
The  time  lag  between  two  successive  captures  was  measured  at  0.5  seconds.  This 


IMAGE  1  IMAGE  2 

Figure  5.  8-Bit  Grey  Scale  Images 


time  lag  was  determined  by  executing  the  Unix  date  command,  capturing  twenty 
images  in  a  software  loop,  executing  date  again,  and  then  dividing  the  difference 
by  twenty.  This  difference  was  taken  ten  diflferent  times  and  each  time  the  same 
measurement  was  found — footnotelt  was  believed  that  capturing  twenty  images  and 
dividing  would  produce  a  better  estimate  of  the  time  between  two  successive  grabs 
because  the  date  command  gives  time  to  the  nearest  second,  and  the  measurement 
was  expected  to  be  in  fractions  of  seconds.. 

4.3  Finding  Motion  Regions 

As  mentioned  in  the  previous  section,  the  frame  by  frame  subtraction  using 
8-bit  grey  scale  images  did  not  provide  the  consistent  motion  expected.  The  funda- 
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mental  assumption  was  that  biological  subjects  “wobble” ,  and  thus  a  clearly  defined 
motion  outline  was  expected  even  when  subjects  were  trying  to  remain  as  still  as 
possible  (17).  Figure  6  clearly  shows  that  this  was  not  the  case.  Multiple  attempts  at 
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IMAGE  1  IMAGE  2 

Figure  6.  Bad  Motion  Image 

forming  this  motion  image  revealed  that  if  the  subject  simply  sat  as  if  he  or  she  were 
working  at  the  computer  terminal,  the  motion  regions  formed  were  too  inconsistent 
to  use  for  segmentation.  This  was  particularly  unfortunate  in  light  of  the  fact  that 
this  was  the  projected  user  scenario  for  the  final  face  recognition  system. 

On  the  opposite  end  of  the  spectrum,  a  motion  image  was  formed  with  the 
subject  greatly  exaggerating  his  motion.  As  can  be  seen  in  Figure  7,  severe  motion 
will  also  produce  a  potentially  unusable  motion  image.  This  type  of  motion  is  essen- 
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IMAGE  1  IMAGE  2  "MOTION" 

Figure  7.  Severe  Motion  Image 
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tially  ignored  in  this  research,  since  a  cooperative  user  was  assumed.  However,  this 
test  revealed  limitation  of  the  system  caused  by  the  relatively  slow  acquisition  rate 
of  the  VideoPix  tool. 

The  noise  in  the  motion  image  was  able  to  be  filtered  out  very  eflfectively  by 
a  median  filter.  Figure  8  shows  this  noise  elimination  by  providing  an  example  of 
a  noisy  motion  image  and  the  resulting  images  after  passing  through  the  median 
filter.  Typically,  a  difference  threshold  of  ten  was  set  for  “motion”  pixels  from  the 


MOTION 


1ST  PASS 


2ND  PASS 


Figure  8.  Median  Filtering  an  Image 


frame  to  frame  subtraction,  and  the  image  was  passed  twice  through  a  median  filter. 
Through  experimentation,  this  difference  threshold  and  two  passes  through  the  filter 
were  found  to  remove  the  noise  in  the  image  consistently. 

To  overcome  this  motion  “inconsistency”,  the  decision  was  made  to  thresh¬ 
old  the  number  of  pixels  “turned-on”  in  the  motion  image.  The  drawbacks  of  this 
approach  are: 

1 .  The  higher  the  threshold  the  more  dramatic  the  motion  must  be  on  the  part 
of  the  user  and  usually  the  longer  that  user  must  wait  for  segmentation. 

2.  The  more  dramatic  the  motion  the  less  precise  the  segmentation  and,  therefore, 
the  less  consistent  the  scale  in  the  segmented  images. 

3.  The  threshold  does  not  guarantee  a  good  motion  outline,  it  simply  indicates  a 
greater  likelihood  of  a  good  outline  given  stable  conditions. 
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Experimentally  trading-off  the  quality  of  the  motion  image  with  the  user  movement 
requirements  and  wait  time,  a  motion  threshold  of  3000  pixels  was  found  to  be  the 
best  motion  threshold.  Of  course  this  threshold  will  vary  with  the  number  of  pixels 
in  an  image  and  the  size  of  the  moving  objects  in  the  image  as  well. 

As  mentioned  in  the  previous  chapter,  moving  objects  found  using  frame-to- 
frame  subtraction  are  not  usually  solid,  but  somewhat  abstract  outlines  and  random 
internal  patterns.  The  process  to  form  a  defined  motion  outline  is  shown  visually  in 
Figure  9  and  briefly  summarized  as: 

1 .  The  noise  in  the  motion  image  was  removed  by  median  filtering. 

2.  The  “motion”  pixel  closest  to  the  top  was  found  for  each  column  of  the  image 
and  all  pixels  below  that  pixel  were  considered  part  of  the  motion  region. 

3.  The  columns  of  this  motion  outline  were  then  grouped  to  close  these  gaps. 

4.  Finally,  single  column  “spikes”  were  eliminated  by  looking  for  large  slopes  in 
one  direction  followed  by  a  large  slope  in  the  opposing  direction. 

Forming  motion  regions  by  subtracting  two  color  images  was  attempted  to 
determine  if  there  were  any  improvement  over  using  grey  scale  images.  However,  the 
motion  region  created  from  two  color  images  did  not  improve  the  consistency  of  the 
motion  outline  at  all,  and  the  noise  in  the  color  image  was  not  filtered  out  effectively. 
Figure  10  is  an  example  of  the  motion  image  created  using  color  images. 

An  alternate  method  of  finding  the  outline  of  moving  objects  could  be  devel¬ 
oped  using  an  edge  enhancement  algorithm.  The  edges  of  the  objects  in  the  input 
image  are  found  using  an  edge  enhancement  algorithm,  and  the  motion  analysis  could 
be  used  to  determine  the  edges  of  the  moving  objects.  Finding  the  exact  edges  of  the 
head  would  allow  easy  elimination  of  all  background  and  also  produce  a  rigid  scale 
standard  for  resized  head  regions.  An  example  of  an  edge  enhanced  input  image  is 
provided  in  Figure  11. 
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Figure  9.  Forming  a  Motion  Region  Image 
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MOTION  FROM  COLOR 


FILTERED  MOTION 


Figure  10.  Motion  Image  From  RGB  Images 


INPUT  IMAGE  EDGE  IMAGE 


Figure  11.  Sample  Edge  Image 


Segmentation  Algorithm  Test 

Possible  face  regions  are  segmented  according  to  the  algorithm  described  in 
Chapter  three.  This  segmentation  algorithm  searches  the  motion  image  for  large 
positive  slope-level  region-large  negative  slope  regions.  The  image  is  searched  using 
several  slope  thresholds  to  account  for  poor  motion  images  and  variations  between 
subjects.  The  top  of  a  motion  outline  of  a  face  was  rarely  flat  with  a  smooth  curve 
at  the  sides  of  the  head.  Many  times  there  were  small  positive  and  negative  slopes 
along  the  top  of  the  outline  and  very  often  small  plateaus  at  the  side  of  the  head.  By 
ignoring  small  slopes  that  did  not  exceed  some  moderate  threshold,  these  small  de¬ 
viations  could  easily  be  overcome.  Some  hairstyles  however  caused  a  more  moderate 
slope  at  the  sides  of  the  head,  forcing  lower  thresholds  to  detect  the  sides  of  the  head. 
By  searching  the  motion  outline  three  times  using  three  different  slope  thresholds, 
it  was  experimentally  found  that  all  these  irregularities  could  be  overcome. 

A  visual  representation  of  the  entire  segmentation  process  is  provided  in  Fig¬ 
ure  12. 

The  first  part  of  the  segmentation  test  was  performed  by  videotaping  several 
subjects  and  allowing  the  segmentation  routine  to  segment  all  regions  that  might  be 
valid  faces.  The  segmentation  test  was  run  three  times  on  the  videotaped  subjects 
with  three  different  thresholds  on  the  number  of  motion  pixels.  Figure  13  shows  a 
sample  8-bit  grey  scale  input  image,  the  motion  image  associated  with  that  image, 
the  segmented  regions  in  that  image,  and  the  standard  size  image  (32  x  32)  for  each 
segmented  region.  A  complete  set  of  these  segmentation  test  images  are  provided 
in  appendix  C.  For  scale  comparison,  all  of  the  segmented  regions  for  the  highest 
motion  threshold  test  are  provided  in  Figure  14. 

The  segmentation  test  results  showed  (a)  each  “successful”  capture  consistently 
had  at  least  one  “properly”  segmented  face,  and  (b)  “properly”  segmented  faces 
were  not  one,  but  a  few,  consistent  scales.  A  “successful”  capture  is  one  in  which 
the  number  of  pixels  changed  in  the  motion  image  exceeded  a  pre-defined  threshold. 
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IMAGE  1  IMAGE  2  "MOTION" 
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Figure  12.  The  Segmentation  Process 
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Figure  13.  Sample  Segmentation  Test  Images 


“Properly”  segmented  means  the  regions  segmented  from  the  original  image  appear 
suitable  for  input  into  a  face  recognition  system.  The  lowest  motion  threshold  setting 
(1500)  had  a  few  badly  segmented  face  regions  due  to  the  poor  motion  regions  on 
the  front  end  of  the  segmentation  algorithm.  The  higher  threshold  settings  (3000 
and  4500)  consistently  segmented  suitable  faces  for  recognition  provided  that  one 
existed  in  the  input  scene. 

The  next  part  of  this  test  investigated  the  scale  “normalization”  capability  of 
this  algorithm.  The  test  was  conducted  by  placing  a  single  subject  in  the  system 
field  of  view  at  three  very  different  “zoom”  settings  on  the  camera.  The  three 
“zoom”  settings  made  the  subject’s  face  three  very  different  sizes  in  the  input  image. 
Three  faces  were  autonomously  segmented  at  each  of  the  three  “zoom”  settings, 
resulting  in  a  total  of  nine  faces  segmented  for  that  subject.  The  results  of  this 
test,  shown  in  Figure  15,  and  the  results  of  the  previous  segmentation  tests,  indicate 
the  segmentation  and  rescaling  is  somewhat  tolerant  of  changes  in  the  input  scale, 
although  more  prototypes  of  each  subject  may  need  to  be  taken  to  account  for  the 
slight  variations  in  scale  that  did  occur. 

Face  orientation  may  alter  scale  depending  on  the  facial  view  segmented,  but 
each  view  will  be  a  consistent  scale.  Thus,  if  a  recognition  system  should  attempt  to 
handle  multiple  orientations,  prototypes  taken  from  different  viewpoints  using  this 
algorithm  should  still  be  suitable. 
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Figure  14.  Sample  Set  of  Segmented  Regions 
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Figure  15.  Segmentation  Scale  Test 
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Faces  segmented  using  this  algorithm  may  be  further  tested  for  their  suitability 
an  input  to  a  face  recognizer  in  a  complementary  thesis  by  Runyon  (22). 

^.5  Binarized  Face  Patterns 

Figure  16  shows  the  8-bit  grey  scale  image  used  as  the  input  for  the  “lamber- 
tized”  images  and  the  variation  images  which  resulted  from  lambertizing  the  image 
using  the  local  brightness  variation  box  sizes  shown  and  then  thresholding  the  result. 
The  grey  areas  are  positive  variations  from  the  mean,  and  the  black  areas  are  the 
negative  variations  from  the  mean.  Figure  17  is  the  result  of  lambertization  and 
thresholding  the  same  input  image  except  it  had  been  reduced  from  640  x  480  pixels 
to  128  X  96  pixels. 

The  lambertized  images  vary  according  to  the  size  of  the  box  relative  to  the 
dimension  of  the  input  image,  but  the  eyes,  nose,  and  mouth  appear  consistently  as 
black  regions.  Since  these  images  are  local  variations,  the  lambertized  images  should 
be  consistent  over  a  large  range  of  lighting  conditions.  Therefore,  by  either  estimating 
the  face  size  in  the  input  image  or  lambertizing  the  image  with  several  different  boxes, 
or  both,  faces  may  be  found  by  developing  an  algorithm  which  searches  for  the  eyes, 
nose,  and  mouth  regions. 

Two  ways  this  algorithm  might  be  combined  with  the  motion  analysis  technique 

are: 

1.  Use  motion  analysis  to  segment  a  possible  face  region,  and  then  use  use  the 
lambertization  technique  to  find  facial  features.  This  may  better  discriminate 
faces  from  non-faces,  and  may  even  reveal  something  of  face  orientation. 

2.  Use  Lambertization  to  find  possible  faces,  and  then  use  the  motion  analysis  to 
discriminate  faces  from  non-faces. 
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INPUT  640  X  480  IMAGE 


9x9  BOX  VARIATIONS 


9x9  BOX  VARIATIONS  17x17  BOX  VARIATIONS  25  x  25  BOX  VARIATIONS 


Figure  17.  Reduced  Input  Lambertization  Images 
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The  advantages  of  the  first  method  are  it  may  be  quicker  and  provide  more  infor¬ 
mation.  The  advantage  of  the  second  method  is  it  may  find  faces  even  when  motion 
may  not  be  available. 
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V.  Conclusions 


This  thesis  demonstrates  that  motion  analysis  alone  may  provide  sufficient 
segmentation  for  a  face  recognition  system.  Even  though  the  motion  outlines  were 
not  as  consistent  as  expected,  the  system  consistently  found  the  heads  in  a  scene  and 
rescaled  these  heads  to  a  new  scale  which  was  fairly  consistent  across  all  subjects. 

This  segmentation  technique  carries  a  low  computational  burden  and  is  still 
relatively  quick.  As  such,  it  may  be  a  valuable  technique  to  fuse  with  other  seg¬ 
mentation  techniques,  such  as  searching  for  face  patterns,  to  (a)  reduce  their  search 
time,  and  (b)  increase  the  consistency  of  discriminating  face  regions. 

The  only  drawback  in  this  thesis  was  the  consistency  of  the  motion  images. 
Why  the  motion  outlines  were  not  consistently  defined  is  not  clear.  Objects  of  similar 
grey  scales  values  might  be  expected  to  obscure  some  boundaries,  but  this  does  not 
explain  the  results  shown  since  the  same  objects  and  background  produce  outlines 
most  of  the  time,  and  only  occasionally  do  not.  The  more  likely  explanation  is  that 
the  subjects  did  not  move  in  direction  of  the  plane  of  view. 

With  a  better  method  of  finding  the  motion  image,  the  system  could  improve 
the  scale  standardization  and  perhaps  even  its  ability  to  discriminate  head  regions 
from  non-head  regions.  Although  the  scale  of  the  segmented  faces  was  consistent, 
it  varied  slightly  depending  on  the  width  of  the  motion  region.  This  is  due  to  the 
fact  that  the  motion  outline  did  not  find  the  edge  of  the  head  exactly.  If  the  motion 
region  found  the  outline  of  the  head  exactly,  scale  would  be  consistent  for  each 
person,  and  all  background  could  be  eliminated  as  well.  Additionally,  with  a  better 
motion  image,  a  more  discriminating  analysis  could  be  conducted  on  the  motion 
image. 
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Appendix  A.  VideoPix 


A.l  General 

VideoPix  is  a  tool  which  provides  image  grabbing  and  manipulation  capability 
to  Sun  SPARCStations.  The  tool  consists  of  (a)  an  electronics  card  inserted  into 
the  SPARCStation  system,  and  (b)  “C”  software  routines.  There  is  a  users’  manual 
titled  Using  VideoPix  which  explains  how  to  install  the  system  and  describes  the 
software  routines. 

This  appendix  discusses  some  of  the  software  routines  and  some  of  the  problems 
associated  with  using  these  routines.  There  is  no  attempt  to  discuss  installation,  but 
to  discuss  the  software,  it  is  useful  to  have  some  understanding  of  some  hardware 
capabilities.  Appendix  B  has  code  which  use  these  routines;  this  code  may  be  useful 
as  examples. 

A. 2  Hardware 

The  input  video  signal  is  decoded  into  4:1:1  YUV  data.  This  translates  to 
two  bytes  per  pixel,  seven  bits  of  Y  (luminance)  per  pixel,  and  fourteen  bits  of  UV 
(chrominance)  per  four  pixels.  The  upper  byte  of  each  pixel  word  contains  the  seven 
Y  data  bits,  and  the  lower  byte  contains  either  two  or  four  of  the  fourteen  UV  data 
bits  and ’t  takes  four  pixel  words  to  describe  the  UV  for  any  one  pixel. 

The  YUV  image  data  is  also  based  on  non-square  pixels;  the  decoder  used  by 
the  tool  was  originally  designed  for  televisions.  Thus,  to  display  the  digitized  images 
correctly,  the  data  must  be  converted  to  square  pixel  data.  The  non-square  data  can 
be  displayed,  but  it  will  appear  distorted  in  the  horizontal  direction. 

When  an  image  is  digitized,  the  images  data  is  stored  in  two  large  FIFOs,  one 
for  each  field  of  the  video  frame.  Each  FIFO  read  increments  an  internal  pointer. 
Since  the  FIFO  memory  is  serial  access  only,  reading  a  specific  pixel  requires  that 
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all  pixels  up  to  that  location  must  be  read  as  well.  The  dimensions  of  the  digitized 
YUV  data  is  720  x  480. 

A. 3  Initialization 

The  key  VideoPix  initialization  software  routines  are: 

vfc_open()  Opens  the  hardware  and  locks  out  other  users. 
vfcjset_port()  Supposedly  sets  the  hardware  to  look  for  a  signal  on  the 
specified  port  (either  either  VFC_PORTl,  VFC_PORT2, 
or  VFC_SVIDEO).  This  routine  did  not  appear  to  work. 

If  only  one  signal  is  coming  into  the  Sun  SPARCSta- 
tion,  it  will  usually  find  that  signal  regardless  of  the  port 
specified  in  this  routine.  With  multiple  active  signals, 
the  tool  seems  to  default  to  port  1 .  The  only  consistent 
method  for  selecting  a  particular  port  seemed  to  be  invok¬ 
ing  the  VideoPix  vfctool  prior  to  executing  user  created 
software,  and  previewing  the  video  signal  on  the  desired 
port.  Then,  after  exiting  the  vfctool,  VideoPix  seems 
to  always  find  the  correct  port  when  executing  the  user 
defined  software. 

vfcjset_format()  Determines  the  format  type  from  the  incoming  signd 

An  example  of  user  software  module  which  uses  these  software  routines  can  be 
found  in  Appendix  B,  z.vfcjsetJiw.c. 

When  the  VideoPix  hardware  is  no  longer  needed,  the  vfc_destroy()  routine 
releases  the  hardware. 
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A.^  Grab  and  Convert 

The  key  VideoPix  routines  to  grab  image  data^  and  convert  that  data  to  8-bit 
grey  scale  or  rgb  color  are: 

vfc^rab()  Instructs  the  hardware  to  digitize  the  next  complete  frame.  This  puts 
the  data  into  the  FIFOs.  Invoking  vfc_yuvread_ntsc()  is  necessauy  to 
put  the  data  into  user  memory  where  it  can  be  manipulated. 
vfc_yuvread_ntsc()  Reads  in  digitized  YW  image  data  from  the  VideoPix  hardware  into 
a  memory  block  which  has  been  allocated  by  the  user.  vfc_grab()  and 
vfc-yuvread_ntsc()  are  usually  executed  together. 
vfc_yuv2y8jitsc()  Converts  the  non-square  pixel  YUV  data  into  square  pixel,  8-bit  grey 
scale  data.  Again,  the  user  must  allocate  memory  for  the  converted 
data  (remember  the  YUV  data  is  two  bytes  per  pixel,  the  8-bit  grey 
scale  is  one  byte  per  pixel).  A  point  the  user  manual  is  not  very  clear 
on  here  is  that  since  the  YUV  data  is  actually  7-bit  of  luminance,  the 
8-bit  data  is  created  by  multiplying  the  7-bit  data  by  2.  VideoPix  does 
this  multiplication  via  look-up  tables.  As  a  consequence,  vfcJnitJut() 
should  be  invoked  prior  to  executing  this  routine  to  initialize  the  look¬ 
up  tables.  According  to  Sun,  a  colormap  offset  of  zero  is  typical. 
vfc_yuv2rgb_ntsc()  Converts  the  non-square  pixel  YUV  data  into  NTSC  RGB  color  data. 

The  color  data  is  four  bytes  per  pixel,  and  the  data  for  each  pixel 
is  put  into  memory  XGBR.  The  upper  byte,  X,  which  is  evidently 
transparency  information,  was  not  used  in  this  research.  The  memory 
allocation  and  look-up  table  comments  given  in  the  vfc_yuv2y8_ntsc() 
discussion  apply  here  as  well. 

An  example  of  user  software  modules  which  use  these  software  routines  can  be 
found  in  Appendix  B,  z_grab_gra.c  and  z_grab_rgb.c. 

*A11  functions  which  are  format  specific  have  both  NTSC  and  PAL  versions;  the  only  format 
referred  in  this  appendix  will  be  NTSC,  but  it  appears  that  PAL  can  replace  NTSC  in  all  cases. 
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Appendix  B.  Source  Code 


This  appendix  contains  a  listing  of  some  of  the  source  code  used  in  performing 
this  thesis.  This  code  is  presented  as  is,  and  no  claims  are  made  as  to  suitability  for 
other  applications. 


B.l  segment2.c 
/* 

*  File:  aegment2.c 

*  Created:  July  1992 

*  By:  Kevin  Gay 

* 

*  Purpose: 

* 

*  Assumes: 

* 

*  Modified: 

*  By: 

*  Why: 

*/ 

^include  <stciio.h> 

^include  <sy8/(ypes.h> 

^include  "vfc_lib.h" 

#include  "globals.h" 

extern  struct  hejid_ptrs  *z_segnient(); 
extern  int  zjstoreJmage(); 

main() 

{ 

ujchar  *face,  *ptr; 

register  int  i; 

struct  head_ptrs  *temp4)tr,  *face_ptrs; 

char  tryagainflO]  ,filename[30]  ,command[80] ; 

int  count=l; 


sprintf ( t  ryagain , "  Xs  " , "  Y" ) ; 
while((tryagain[0]==  *  Y  ’ )  1 1  (tryagain[0]==  ’  y  ’ ) ) 

{ 

printf( "Please  look  at  camera  until  you  he^u:  a  beep.\n"); 
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face4>trs  =  (struct  head-ptrs  *)zjsegment(); 

system("echo  T"); 

if(f2K:e^trs  ==  NULL) 

{ 

printf( "Trouble  getting  images\n"); 
exit(l); 

} 

else 

{ 

while(face4)trs  NULL) 

{ 

sprintf(filenaine, 

"  %8%dy,s  " , "  f  ac  e  "  ,count , " .  sm" ) ; 
if(zjstoreJmage(face^trs— *head, filename, SMJSIZE)<0) 
fprintf(stderr, "Unable  to  write  to  file\n"); 
sprintf(command,"graytorle  -o  bold.rle  Ud  Xd  y,s", 
SM.WIDTH,SMJIEIGHT, filename); 
sprintf(filename, 

"y,ay,dy,s", "face", count, "  .rle"); 
sy8tem(command); 

sprintf(command,"rleflip  -v  -o  hold. rle", filename); 

system(command) ; 
sy8tem("rm  hold. rle"); 

sprintf(command,"xli  -quiet  -zoom  500  ‘/.s  fc",filename); 

system  ( command ) ; 

temp4)tr  =  face_ptrs; 

face_ptrs  =  temp.ptr— »next; 

free(temp4)tr— +head); 

free(temp4)tr); 

coimt++; 

} 

} 

printf("Would  you  like  to  try  again?  (Y  or  N)"); 

gets(tryagain); 

printf("\n"); 

} 


B.2  zjsegment.c 

J1^^L^^^^H^^^HHii^■^i^^^^^*^^*^^■¥*******■^^****^H^**^l^*i^************************^^***** 

*  File:  z^egment.c 

*  Created:  August  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  This  code  is  a  set  of  commoii  routines  to  save  time/typing. 

*  This  code  grabs  all  potential  bead  regions  in  an  image, 

*  reduces  them  (SMJSIZE)  and  returns  these  reduced  bead  images 

*  in  a  structure;  a  nuii_ptr  is  returned  if  unsuccessful. 

* 

*  Assumes: 

* 

*  Modified: 

*  By: 

*  Why: 

f^indude  <stdio.h> 

^indude  <sy8/types.h> 

#indude  "vfc.lib.h" 

#indude  "globals.h" 

extern  struct  image4>trs  ♦zjnotion(); 
extern  Ujchar  *zjsegj:egions(); 

extern  struct  region  *zjoutline(); 

struct  head^trs  «zj3egment() 

{ 

Ujchar  *face; 

struct  region  *head ^regions,  *face_ptr; 
struct  image_ptrs  *im-ptrs; 

struct  head-ptrs  *head  Jist=0,  *temp-head,  ♦nuU_ptr=0; 


im4)trs  =  (struct  image-ptrs  *)z-motion(); 
if(im4Jtrs  ==  NULL) 

{ 

printf( "Trouble  getting  images\n"); 
return  null-ptr; 

} 

head j:egions=  (struct  region  *)z_outline(im4>trs— ►motion, 
VFC_NTSC-WIDTH,VFC-NTSCJ!EIGHT); 

if(head_region8  ==  NULL) 

printf("No  head  regions  found\n"); 
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else 


while(headj:egion8  ^  NULL) 

{ 

face=(ujchar  ♦)zjeg_regions(im4>trs— ►image2, 
headj-egions,VFC-NTSC.WIDTH, 
VFCJJTSC-HEIGHT,SM.WIDTH,SM-HEIGHT); 
if(face  ==  NULL) 

{ 

printf("face  ptr  is  null\n"); 

} 

else 

{ 

temp  Jiead=  (struct  head4>tr8  *) 
malloc(sizeof(struct  head^trs)); 
if(tempJiead==NULL) 

{ 

printf( "Problems  adding  to  list\n"); 
return  nail_ptr; 

} 

else 

{ 

tempJiead— ►next=headJist; 
temp  Jvead— ♦head  =  face; 
headJist  =  tempJiead; 

} 

} 

face-ptr  =  head_regions; 
head-regions  =  face_ptr— >next; 
free(face4)tr); 

} 

} 


free(im4)trs— »imagel); 
free(im.ptrs— »image2); 
free(im.ptrs— ♦motion); 
free(im.ptrs); 
free(head_regions) ; 


printf("z_segment  completeNnXn"); 
return  head.list; 

} 
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B.3  z-inotion.c 


f^^^**t^^^l****t**^ii^*^H^*^l*^i********************************************** 

*  File:  zjnotion.c 

*  Created:  August  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  This  code  is  a  set  of  common  routines  to  save  time/typing. 

*  This  code  grabs  two  8- bit  grey  images  using  z^ab^a 

*  and  finds  the  difference  between  the  two  images  ( to  find 

*  out  what  was  moving)  using  zJind.diff. 

*  The  three  images  (both  8-bit  grey  and  the  difference) 

*  are  put  into  a  structure  which  is  returned;  a  nulLptr 

*  is  returned  if  unsuccessful. 

*  The  images  are  all  NTSCJ5IZE  (640x480)  and  Ibpp. 

*  The  vfc  hardware  is  set  upon  entry  and  released  prior 

*  to  departure  from  routine. 

* 

*  Assumes: 

4t 

*  Modified: 

*  By: 

*  Why: 

#include  <8tdio.h> 

#include  <sys/types.h> 

#indude  "vfc.lib.h" 

#include  "globals.h" 


extern  UJch^lr 
extern  ujchar 
extern  ujchar 
extern  int 


*z^et.vfc_hw(); 

♦z^ab^aO; 

♦zJfindjdiffO; 

zjnedianO; 


struct  image4>trs  *z_motion() 

{ 

ujchar  ’»image[2],  ♦motion,  ♦ptr; 

register  int  i; 

int  pixelsxhanged=0; 

struct  hw_controls  ♦hw.ptrs; 

struct  image-ptrs  ♦images,  ♦null-ptr=0; 


hw_ptrs  =  (struct  hwjcontrols  ♦)zjset_vfc_hw(); 
if(hw4)trs  ==  NULL) 
return  null4)tr; 
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A 

*  Create  a  difference  image  and  check  to  see  if  there  is  enough 

*  movement  to  exceed  threshold. 

V 

while(pixel8xhanged<MOTION-THRESHOLD) 

{ 

for(i=0;  i<2:  i++) 

{ 

image[i]=(ujchar  ♦)z-grab^a(hw_ptr8); 
if(image[i]  ==  NULL) 

{ 

printf("  image  ptr  is  nullVn."); 
return  null^tr; 

} 

if(i==l) 

{ 

motion=(uxhar  *)z_findjdifF(image(i— 1), 
image[i],  NTSC^IZE); 
if(motion  ==  NULL) 

{ 

printf("motion  ptr  is  null\n"); 
return  null_ptr; 

} 

} 

} 

for(i=l;  i<2;  i++) 

if(zjnedian(motion,VFCJ^TSC.WIDTH,VFC-NTSC-HEIGHT)<0) 
fprintf(stderr, "Median  filtering  error\n"); 
ptr  =  motion; 
pixelsjchanged  =  0; 
for(i=l;  i<(NTSC^IZE*2/3);  i++) 
if(*ptr++  ==  0) 
pixels  jchanged++; 

} 

images  =  (struct  image-ptrs  *)malloc(sizeof(struct  image_ptrs)); 
if  (images  ==  NULL) 

{ 

printf("Malloc  error"); 
return  null4>tr; 

} 

images— ♦imagel=image[0]; 
images— ►image2=image[l]; 
images— >motion=motion; 
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vfc  jdestroy  (hw4>tr8— >vfc  jdev) ; 
free(hw4)tr8); 

printf("z_motion  is  complete\n\n'’); 
return  images; 

} 
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B.4  z_set-vfc_hw.c 

/**♦♦♦*♦♦♦♦****♦*♦*♦****♦♦♦***♦**♦******♦*♦*♦***♦*************♦************ 

*  File:  z^et.vfcJiw() 

*  Created:  7  July  1992 

*  By:  Kevin  Gay 

* 

*  This  code  was  taken  from  an  example  wntten  by  Sun  which  came  with  the 

*  VideoPix  card.  The  example  was  hwjsetup.c  which  was  labelled 

*  Copyright  (c)  1990  by  Sun  Microsystems,  Inc. 

*  0ident  ”@(#)hwjietup.c  1.5  90/12/12  SMI” 

* 

*  Purpose:  The  code  is  intended  to  initialize  the  VideoPix  hardware. 

*  Returns  hardware  pointers  if  successful;  null.ptr  if  not. 

* 

*  Assumes:  Incoming  signal  is  NTSC  format;  PORT  set  in  global. h 

* 

*  ModiSed: 

*  By: 

*  Why: 


^include  <stdio.h> 
#include  <8ys/types.h> 
#include  "vfc.lib.h" 
#include  "globals.h" 


struct  hwjcontrols  *zjset.vfcJiw() 

{ 

int  rc,  format,  v_format,  wmjoffset; 

VfcDev  *vfc-dev; 

struct  hw.controls  ♦hw.ptrs,  *nulLptr=0; 


/* 

*  Open  the  hardware  just  use  the  default  case 

*  ” /dev/vfcO”  and  test  for  success. 

*  Lock  the  hardware  to  deny  other  programs  access 

V 

f*^i***  Open  the  vfc  hardware  and  set  a  software  lock  ******f 

if((vfcjdev  =  vfcjopen(NULL,  VFCXOCKDEV))  ==  NULL) 

{ 

fprintf(stderr, "Could  not  open  hardwareXn"); 
return  null4)tr; 

} 

/**  Set  the  input  signal  port  (PORT  defined  in  globals.h  **/ 
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vfcjet4)ort(vfcjdev,  PORT); 

/**♦♦  Determine  format  of  incoming  signal  and  lock  on  *****/ 

rc  =  vfcjset_format(vfcjdev,  VFC_AUTO,  &fonnat); 
if(rc  <  0) 

{ 

^rintf(8tderr,"No  video  signal  detected\n"); 
return  null^tr; 

} 

if(format  ==  NO_LOCK) 

{ 

fprintf(8tderr, "Unable  to  lock  onto  signalVn"); 
return  nuU^itr; 

} 

vTormat  =  vfc_videoTormat{format); 

if(vJonnat  96  VFC-NTSC) 

{ 

fprintf(stderr, "Warning  :  must  be  NTSC  formatXn"); 
return  null4)tr; 

} 

if(vTormat  ==  VFC_NTSC) 
vfcjwljust Jiue(vfcjdev,  0); 

Initialize  colormap  offset  and  look-up  tables  ♦****/ 

wmjoffset  =  0;  /*  Initialize  H/W  colormap  offset;  usu.  0  */ 
vfcJnitJut(wnij[)ffset);  /*  initialize  look-up  tables  */ 

/*:»*4i*4>4<**  Assign  harware  control  variables  **************/ 

hw4)trs= (struct  hw .controls  *)malloc(sizeof(struct  hw .controls)); 
if(hw4)trs  ==  NULL) 

{ 

fprintf(stderr,"malloc  hw_ptrs  errorXn"); 
exit(l); 

} 

hw4)trs— »vfcjdev  =  vfcjdev; 
hw4)trs— »colonnapjDffset  =  wmjoffset; 

printf("z_set_vf c_hw  complete\n\n"); 

return  hw.ptrs; 

} 
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B.5  z-grab-gra.c 

*  Fiie;  z-grab_gra.c 

*  Created:  3  June  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  The  code  is  intended  to  allocate  memory  for  image  data, 

*  grab  YUV  image  data,  and  then 

*  convert  that  data  to  8-bit  grey,  square  pixel  data. 

*  The  code  returns  a  pointer  to  the  8  bit  grey  data  if 

*  successful,  or  a  NULL  if  an  error  has  occurred. 

* 

*  Assumes:  z^et.vfcJiw.c  has  been  executed. 

*  YUV  data  is  2  bytes  per  pixel  (bpp),  8  bit  gray  1  bpp. 

* 

*  Modified: 

*  By: 

*  Why: 

#include  <stdio.h> 

#include  <3ys/types.h> 

#include  "vfc.lib.h" 

#include  "globals.h" 

u-char  ★z_grab-gra(hw_ptrs) 
struct  hwjcontrols  *hw_ptrs; 


{ 

ujchar  *yuv_data,  *im_data,  *mein_ptr,  *null_ptr  =  NULL; 

register  int  i; 

A 

♦  Allocate  space  for  images. 

V 

yuvjdata  =  (ujchar  *)malloc(YUV_SIZE); 

if(yuvjdata  ==  NULL)  { 
peiTor("malloc"); 
return  null_ptr; 

} 

imjdata  =  (u_char  ♦)malloc(NTSCJSIZE); 

if(im Jata  ==  NULL)  { 
perror  ( "  mal  1  o  c  " ) ; 
free(yuvjda*a); 
return  nul]_ptr; 
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} 


/*  printf(”YUV  and  8-bit  image  memories  created\n”); 

*1 

A 

*  Do  a  grab  and  read  in  an  image 

V 

if(vfc^ab(hw4)trs— ►vfcjdev)  <  0)  { 

A 

♦  If  the  grab  fails  the  image  read  will 

*  be  garbage. 

*1 

fprintf(stderT,"WaLrning,  grab  failedXn"); 

free(3ruvjdata); 

free(imjdata); 

return  null4>tr; 

} 

if(vfc_yuvread-sq(hw4>trs— ♦vfcjdev,  yuv_data,  VFC_NTSC)  <  0)  { 
fprintf(atderr, "Warning,  vfc_yuvread  failedXn"); 
free(yuvjdata); 
free(imjdata); 
return  null4)tr; 

/*  printf(”YUV  image  grabbed  and  read  from  the  hardware\n”); 

*/ 


A 

♦  Convert  the  YUV  data  into  8-bit  gray  sqmure  pixel  data. 

*/ 

if  (hw4>trs— >colormapjofFset  ^  0) 

vfcjnitJut(hw4)tr3— »colormap-Dffset); 
vfc-yuv2y8jQtsc(yuvjdata,  inudata); 

/+  printf(”YUV  data  converted  to  8-bit  gray  data\n”); 

*/ 

free(yuvjdata); 

printf("z_grab_gra  completeXnXn"); 
return  inuiata; 

} 
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B.6  zJind_dilf.c 


*  File:  z.Bndjdiff.c 

*  Created:  9  July  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  The  code  is  intended  to  allocate  memory  and  store 

*  the  pixel  by  pixel  difference  between  two  images. 

*  The  resulting  image  is  binarized  (255  or  0)  depending 

*  whether  the  aJfference  exceeds  the  DIFFERENCE-THRESHOLD. 

*  Difference  pixels  exceeding  the  threshold  are  0  - 

*  255  is  white,  0  is  black;  difference  is  0  to  avoid 

*  a  toner  test  when  printing  out  difference  images. 

*  The  code  return  a  pointer  to  the  difference  data;  the 

*  pointer  will  point  to  NULL  if  an  error  has  occurred. 

* 

*  Assumes:  Both  input  images  are  8  bit  grey  image  data  (1  byte/pixel) 

*  and  the  same  size  (difference  data  is  Ibpp  and  same  size 

*  as  well). 

* 

*  Modified: 

*  By: 

*  Why: 

*^Hm^*1t**t*****************************1i***0*****************ltl*****^i^l**:^*t/ 

#include  <stdio.h> 

#include  <sys/type8.h> 

^include  "vfc.lib.h" 

^include  "globals.h" 

#define  DIFFERENCE-THRESHOLD  10 

ujchar  *z.findjdiff(imljdata,  im2.data,  size) 
u-char  «imljdata,  %im2-data', 
int  size; 

{ 

ujchar  *iml_ptr,  *im2-ptr,  fdiff-data,  *diff_ptr, 

♦nulLptr  =  0; 
register  int  i; 

diffjdata  =  (ujchar  *)malloc(size); 
if(diffjdata  ==  NULL)  { 
perror("malloc"); 
return  nulLptr; 

} 

iml-ptr  =  iml-data; 
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im2-ptr  =  im2jdata; 
diff^tr  =  diffjiata; 
for(i=0;  i<(size);  i++) 

{ 

if((ab8(*iml-ptr++  -  *im2^tr++)-DIFFERENCE.THRESHOLD)>0) 
♦diff_ptr+4-  =  0; 
else 

*difF^tr++  =  255; 

} 

printf("z_fiiid_diff  complete\n\n"); 
return  diffulata; 

} 
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B.  7  z-median.c 


*  File:  zjnedian.c 

*  Created:  10  July  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  The  code  is  intended  to  median  Biter  an  input  image. 

*  The  code  creates  a  temp  memory,  and  Biters  as  follows: 

*  -  check  if  1st  row,  last  row,  or  other 

*  -  check  if  1st  column,  last  column,  or  other 

*  -  set  pixel  to  majority  value  of  pixels  which 

*  ’’surround”  that  pixel. 

*  Returns  1  if  successful,  -1  if  not. 

* 

*  Assumes:  The  input  image  data  is  binarized  (either  255  or  0) 

*  8  bit  grey  image  data  (1  byte/pixel). 

* 

*  Modified; 

*  By: 

*  Why: 

***^t^t*^i*****************^tt***********************************************/ 

#include  <stdio.h> 

#include  <sys/types.h> 

#include  "vfc.lib.h" 

#include  "globals.h" 

int  z-median(imjdata,  width,  height) 
u-char  ♦im.data; 
int  width,  height; 

{ 

uxhar  *medjdata,  *iml-ptr,  ♦im24)tr, 
register  int  i,  j; 
int  sum; 

medjdata  =  (uJch^^:  *)malloc(width*height); 
if(medjdata  ==  NULL)  { 
perror  ( "malloc  " ) ; 
return  —1; 

} 

iml4)tr  =  imjdata; 
im2_ptr  =  medjdata; 
for(i=l;  i<height;  i++) 

{  /♦  Since  0  is  ”on”  */ 

for(j=l;  j<width;  j++)  /*  let  ties  goes  to  */ 

{  /*  0  (best  5  of  9)  */ 
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if(i==l)  /*♦*♦♦**♦******♦*♦***/ 

{ 

/****  First  Row  ****/  if(j==l) 

/*  -  first  column  */  sum  =  ♦iml^tr4-+  +  ♦(iml4)tr+l)+*(iml4)tr+width) 
/*  -  last  column  */  +*(iml-ptr+width+l)  +  255  +  255; 

/*  -  other  columns  */  else  if(j==:width) 

s\un  =  *iml4)tr++  +  *(iml4)tr— 1)  + 

*(iml4)tr+width)  +  *(iml-ptr+width— 1)  + 

255  +  255; 
else 

sum  =  *iml-ptr++  +  *(iml_ptr-l)  +  255  + 

*(iml_ptr+width)  +  *(iml^tr+width— 1)  + 
*(iml4)tr+widtli+l)+*(iml4)tr+l); 

} 

else  if(i==height) 

{ 

/****  Last  Row  *****/  if(j==zl) 

/*  -  first  column  */  sum  =  *iml-ptr4-+  +  *(iml4)tr+l)  + 

/*  -  last  column  */  *(iml-ptr-width)  +  *(iml-ptr-width+l)  + 

/*  -  other  columns  */  255  +  255; 

else  if(j==width) 

sum  =  ♦iml-ptr++  4-  *(iml-ptr-l)  + 

♦(iml4)tr— width)  +  *(iml4>tr— width— 1)  + 

255  +  255; 
else 

sum  =  ♦iml_ptr++  +*(iml-ptr-l)+*(iml^tr+l)+ 
♦(iml^tr-width)  +  *(iml_ptr-width-l)  + 

♦  (iml^tr— width+1)  +  255; 

} 

else 

{ 

/***  Other  Rows  ****/  if(j=:=l) 

/*  -  first  column  */  sum  =  ♦iml,ptr++  +  ♦(iml^tr+1)  +  255  + 

/*  -  last  column  */  *(iml-ptr-width)  +  *(iml4)tr-width+l)  + 

/*  -  other  columns  */  *(iml4)tr+width)  +  *(iml-ptr+width+l); 
else  if(j==width) 

sum  =  ♦iml-ptr+4-  +  *(iml-ptr-l)  +  255  + 

*(iml4>tr-width)  +  ♦(iml^tr-width-1)  + 

*(iml4)tr4-width)  +  *(iml_ptr+width-l); 
else 

sum  =  ♦iml-ptr4-+  +*(iml4)tr— l)^-*(iml-ptr^-l)+ 

♦(iml4)tr-width)  +  *(iml^tr-width-l)  + 

*(iml4)tr-width+l)  +  ♦(iml.ptr+width)  + 

*(iml^tr+width-l)  +  ♦(iml_ptr+width+l); 

} 

if(sum>(5*255)) 

*im2_ptr+-)-  =  255; 
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else 

*im2^tr++  =  0; 

} 

} 

iml^tr  =  imjdata; 
im24>tr  =  medjdata; 
for(i=0;  i<(height*width);  i++) 
*iml^tr++  =  *im2_ptr++; 
free(medjdata); 

printf("z_median  complete\n\n"); 
return  1; 

} 
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B.8  z-outline.c 


f^i^**^L^l*^i^^^i:^^^^**:^^^^,*^l^H^^l^t^it*^^***t*:ti^m********************************* 

*  File:  z-outline.c 

*  Created:  August  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  This  code  makes  an  outline  image  of  moving  regions  by 

*  looking  for  the  first  motion  pixel  from  the  top  in  each 

*  coJunui.  Some  grouping  is  done  to  dean  up  the  outline 

*  and  some  spikes  are  removed.  Then  possible  head  regions 

*  are  found  by  finding  positive-zero-negative  slope  regions. 

*  The  code  to  create  a  outJine  image  remains,  but  it  is 

*  commented  out. 

* 

*  Assumes; 

* 

*  Modified: 

*  By  : 

*  Why: 

#include  <stdio.h> 

#include  <8ys/types.h> 

#include  "vlc.lib.h” 

#include  "globals.h" 

#deane  MAX.WIDTH  VFCJ^TSC.WIDTH  /**  640  **/ 

#deane  MAXJIEIGHT  VFC-NTSC  JIEIGHT  /**  480  *♦/ 

#define  GROUPING  10 
#define  MIN  JACE.WIDTH  100 

extern  ujchar  ♦z_8toreJmage(); 

struct  region  *zJ)utline(motion,  width,  height) 
ujchar  ♦motion; 
int  width,  height; 

{ 

register  int  i,  j,  slope_threshold; 

int  top[MAX-WIDTH],  shorttopfMAX.WIDTH/GROUPING], 
topjlope[(MAX_WIDTH/GROUPING)-l), 
found,  leaveJoop,  repeat,  not  smaller, 
r_col,  Lcol,  temp,  remainder; 
ujchar  ♦im^itr,  ♦outline; 
char  newname[30]; 

struct  region  ♦ptr,  ♦ptr2,  ♦poss_heads=0; 
static  int  cnt=0; 
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if  (width>MAX_WIDTH) 

{ 

printf("Too  vide  for  routineXn"); 
return  poss  Jieads; 

} 

if  (height>MAX_HEIGHT) 

{ 

printf("Too  high  for  routineXn"); 
return  possJieads; 

} 

/****«:4i  Find  Srst  motion  pixels  in  each  column  (from  top) 

(skip  fast  ten  rows  in  case  of  border  and 

skip  Brst/last  ten  columns  in  case  of  border)  ♦*♦****♦/ 

im^tr  =  motion; 
for(i=0;  i<10;  i++) 
top[i]  =  height; 
for(i=10;  i<(width— 10);  i++) 

{ 

j=ii; 

im4)tr  =  motion+(10*width)+i; 
found  =  0; 

while((found==0)&&(j<height)) 

{ 

if(*im4)tr  ==  0) 

{ 

top[i]  =  j; 
found  =  1; 

} 

else  if  (j++  ==  height) 
top[i]  =  height; 
im-ptr+=(width); 

} 

} 

for(i=(width— 10);  i<width;  i++) 
top[i]  =  height; 

/*♦*♦♦**♦♦  Clean  up  top  edges  by  grouping  columns  ♦♦***♦♦♦****/ 

for(i=0;  i<{width/GROUPING);  i++) 

{ 

shorttop{i]  =  height; 
for(j=0;  j<GROUPING;  j4-+) 

if(top((GROUPING>»i)+j)<shorttop(i)) 
shorttop[i]  =  top[(GROUPING*i)+j]; 
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} 


/♦*****♦♦*♦*♦♦♦******  Piajj  slopes  from  top  *****************1^^^/ 

for(i=0;  i<((width/GROUPING)-l);  i++) 

topjlope[i]=((shorttop[i+l]-shorttop[i])/(GROUPING*2)*(-l)); 
remainder  =  ab8((8horttop(i+l]-8horttop[i])%(GROUPING*2)): 
if(  (GROUPING— remainder)  >0) 
if(8horttop[i+l]  >8horttop{i] ) 
topjslope[i] — ; 
ebe 

topjslope[i]++; 

} 

/**«*«***:)<****  Eliminate  some  spikes  ♦♦*♦*♦♦♦♦♦*♦*♦**♦**♦*♦***/ 

for(i=0;  i<((width/GROUPING)-2);  i++) 
if(ab8(  top  jslopefi] )  >  1 ) 

{ 

if((ab8(top^lope[i]+top^lope[i+l])<top-slope[i])&& 

(ab8(topjslope[i+l])>l)) 

{ 

shorttop(i+lj  =  8horttop(i]-, 
top-8lope[i]  =  0; 

topjlope[i+l]  =  ((8horttop[i+2]  — 

^  shorttop[i+l])/(GROUPING*2)*(-l)); 

else  jf((abs(topjlope[i]+topjlope[i-l])<top^lope[i)) 
&&(i>0)&&(abs(topjslope[i— 1])>1)) 

shorttop(i]  =  shorttop[i-l]; 
topjslope[i— 1]  =  0; 
topjslopeii]  =  ((shorttop(i+l]  - 

8horttop[i])/(GROUPING*2)*(-l)); 

} 

A***********  Find  potential  faces  from  the  top  *♦♦♦♦*♦****♦**/ 
for(slopeJ;hre8hold=l;  slope_threshold<3;  slope_threshold++) 

1j:o1=0; 

while(Ixol<  ( ( width/GROU  PING )  -  3) ) 

{ 

found  =  0; 

if ( top  jslope  [1  jcoI]  >  slope -threshold ) 

{ 
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rjcol=ljcol+l; 

leaveJoop=0; 

while(leaveJoop==0) 

{ 

if(ab8(top^lope[rjcol])<slope_threshold) 

/*  Beginning  at  leftmost  ♦/  rjcol++; 

/*  column,  find  6rst  col*/  else  if(topjslope[rjcol]>slopeJ;hreshold) 

/*  with  pos  slope  (Lcol)*/  { 

/*  above  threshold.  */  Lcol=rjcol; 

/*  While  searching  for  */  rjcol+4'; 

/*  neg  slope  above  thres-*/  } 

/*  hold  (rjcol),  update  *f  else  /*topMope[rjcol]<((-l)*slopeJ,hreshold)*/ 
I*  lu:ol  w/each  pos  slope*/  { 

/*  above  threshold  */  temp=ljcol— 1; 

/««*«:(c«*:^4i***i^4‘*«********«/  not  jmaller=l; 

while((temp>0)&&(notjmaller==l)) 

{ 

if(topjlope[temp]>top^lope[ljcol]) 

{ 

ljcol=temp; 
temp — ; 

} 

else 

notjsmaiier=0; 

/*  Once  col  w/neg  slope  */  } 

/*  is  found  (r.col),  push*/  temp=r-col+l; 

/*  Ijcol  back  until  slope*/  while((temp<((width/GROUPING)— 1)) 

/*  is  decreasing,  and  */  && (leave Joop==0)) 

/*  push  rjcol  forward  */  { 

/*  until  slope  increase  */  if(top^lope[temp]<topjslope[rxol]) 

/***:t**Jt:Ht**t*************/  { 

r-col=temp; 

temp++; 

} 

else 

leaveJoop=l; 

} 

if(((rjcol-lxol)*GROUPING) 

>MIN_FACE.WIDTH) 

/**ir^***t***********************/  ( 

/*  If  pos-neg  slope  region  is  */  ptr= (struct  region  *) 

/*  found,  check  for  duplicate  */  raalloc(sizeof(struct  region)); 

/*  regions.  If  not  duplicate,  */  ptr— ►x=(1jco1*GROUPING)+1; 

/*  then  add  to  linked  list  of  */  ptr— *width=(rxol+2)* 

/*  possible  head  regions  */  GROUPING— ptr— ►x; 

ptr— *y  =  shorttop[rxol]; 
for(i=lxol;  i<rxol;  i++) 
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if(ptr— ►y  >  shorttopfi]) 
ptr— *y  =  shorttopfi]; 
if ( (ptr— ►y+ptr— >  width*3 /2)  >height ) 
ptr— ►height= (height — ptr— >y) ; 
else 

ptr — ♦height=ptr — ►width*3/2; 
repeat=0; 
ptr2=possJieads; 
while(ptr2#NULL) 

{ 

if((ptr— ►x==ptr2— 

(ptr-»y==ptr2— ►y)&& 

(ptr— »width==ptr2—>  width)) 
repeat=l; 
ptr2=ptr2— »next; 

} 

if(repeat==0) 

{ 

ptr— ♦next=po8sJieads; 
poss-heads  =  ptr; 

} 

else 

{ 

free(ptr); 

} 

} 

leave  Joop  =  1; 
ljCol=rjCol; 

} 

if(rjcol>((width/GROUPING)-l)) 

{ 

IjcoI  =  r-col; 
leave  Joop  =  1; 

} 

} 

} 

else 

Lcol++; 

} 

} 

Create  an  image  of  the  motion  outline  *♦♦***♦**♦/ 

cnt-h-h; 

outline  =  (u.char  *)malloc(width*height); 
if(outline  ==  NULL) 


{ 

perror(”maUoc”); 

return; 

} 

im^tr  =  outline; 
for(i=0;  i<widtb;  i++) 

{ 

im-ptr— outline; 
ina^tr+—i; 

for(j=l;  j<sborttop(i/10j;  j++) 

{ 

*im-ptr  =  255; 
im4)tr+=(  width  ); 

} 

for(j=l;  j<((beigbt-sborttop[i/10})+l);  j++) 

{ 

*im-ptr  =  0; 
im-ptr+=(  width  ); 

} 

} 

sprintf(newname,”%s%d%s”,”outline”,cnt,”.gra”); 
if(zj3toreJmage(outline,  newname,  widtb*height)  <  0) 
fprintf(8tderr,” Unable  to  write  %s  to  Rle\n”, newname) 

free(outline); 

*1 

printf(”z_outline  complete\n\n"); 
return  possJieads; 

} 
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B.9  zjseg-regions.c 


*  File:  zseg-regions.c 

*  Created:  August  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  This  routine  takes  in  an  image  and  a  region  structure 

*  which  speciSes  the  rectangular  region  to  be  segmented 

*  from  an  image. 

*  The  segmented  portion  is  then  reduced  to  some  specified 

*  dimension  and  returned;  null  ptr  is  returned  if  error. 

*  The  code  for  viewjace  is  left  in,  but  commented  out. 

*  The  code  for  viewJace  will  create  an  image  the  same 

*  dimensions  as  the  input  image  with  only  those  pixels 

*  to  be  segmented  assigned  grey  scale  values.  The 

*  remaining  pixels  are  white  (255). 

* 

*  Assumes: 

* 

*  Modified; 

*  By  : 

*  Why: 

^^,^^^^^,t^l*l^^,^^^l^l^tiH,^,t*****************************************************/ 


#include  <stclio.h> 

#include  <sys/types.h> 

#include  "vlc.lib.h" 

#indude  "globals.b" 

extern  ujchar  *z_reduce(); 
extern  iot  z  jtore  image  (); 

ujchar  ♦z_segjregions(image,  head,  width,  height,  seg.width,  segJieight) 

uxhar  ♦image; 

struct  region  ♦head; 

int  seg.width,  segJieight; 

{ 

u.char  ♦ptr,  ♦ptr2,  ♦face,  ♦segJace,  ♦viewjace, 

♦nulLptr=0; 
char  newname[30]; 
register  int  i,  j; 

static  int  cnt=0;  /♦  Allows  multiple  calls  with  unique  ♦/ 

/♦  filenames.  ♦/ 

f^,******  Segment  the  potential  face  from  the  image  ♦♦♦♦♦♦♦♦♦♦/ 

face  =  (ujchar  ♦)maIloc(head— >width^head-+height); 
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if(face  ==  NULL) 

{ 

perror  ( "malloc  " ) ; 
return  null^itr; 

} 

ptr  =  image; 
ptr2  =  face; 

ptr+=(((head— ►y— l)*width)+(head— »x— 1)); 
for(j=0;  j<head— ►height;  j++) 

{ 

for(i=0;  i< (head— ►width);  i++) 

*ptr2++  =  *ptr++; 
ptr  4-= (width — head— ►width ) ; 

} 

seg_face=(ujchar  *)z_reduce(face,head— ►width, head— ►height, 
seg_width,seg  Jieight) ; 
if(3eg_face  ==  NULL) 

{ 

printf("aeg_face  ptr  is  nullXn”); 
return  null-ptr; 

} 

/ii*^,*^^i***>n^^^i****^i***  Create  viewJ'ace  ♦***♦**♦♦****♦*♦♦*♦*/ 

/*  cnt++; 

viewJ'Bu:e  —  (u.char  *)imdloc(width*height); 
i{(viewJace  ==  NULL) 

{ 

perror  ("malloc”); 
return  null^tr; 

} 

ptr2  =  viewJ’ace; 

for(i=0;  i<(width*height);  i++) 

*ptr2-h+  =  255; 
ptr  =  image; 
ptr2  =  viewJace; 

ptr-h=(  (  (head->y-l  )*width)+(bead->x-l)); 
ptr2d-=(  (  (head~>y-l  )*width)+(head->x-l)); 
for(j=0;  j<head-> height;  j+-f-) 

{ 

for(i=0;  i<head->  width;  i-h-h) 

*ptr2+-h  =  *ptrd-d-; 
ptr+=(  width-head->  width); 
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ptr2-h=(  widtb-bead->  mdtb); 

} 

8printf(newname,  ”%s%d%s”,  ”vJew”,cnt,  ”.gr&”); 
if(zj3toreJmage(viewJ‘ace,  newname,  widtb*beigbt)  <  0) 
fpriatf(stderr,”  Unable  to  write  %s  to  £le\n”,newnanie); 

free(viev.  fitce); 

Ftee  memory  and  return  image  ♦*♦*♦♦*♦♦**♦*******♦/ 

free(face); 

printf("z_3eg_regions  complete\n\n'*); 
return  seg_face; 


B.IO  z_reduce.c 


*  File:  z-reduce.c 

*  Created:  13  July  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  The  code  is  intended  to  reduce  an  input  image,  of  some 

*  arbitrary  height  and  width,  to  a  reduced  height 

*  and  width,  also  an  input. 

*  The  reduction  algorithm  simply  skips  bits. 

*  The  algorithm  skips  more  bits  on  the  side  of  the  image 

*  to  be  reduced  in  cases  where  the  width  division  is  not  even, 

*  and  more  bits  are  skipped  on  the  bottom  of  the  input  image 

*  when  the  height  division  is  not  even. 

*  Returns  a  pointer  to  the  memory  block  containing  the 

*  reduced  image  data  upon  success;  a  null  ptr  if  not. 

* 

*  Assumes:  The  input  image  data  is  8  bit  grey  image  data  (1  byte/pixel). 

♦ 

*  Modified: 

*  By: 

*  Why: 

**iHi*iHi****iH,*^lii:H,iLit,t*t***i,***ii:i,*****i,*:t,:^^,ic*1,^,4c^c:t‘^i*t********************/ 

#include  <8tdio.h> 

#include  <sys/types.h> 

#include  "vfc.lib.h" 

#include  "globals.h" 

u.char  *Z-reduce(imjdata,  old.width,  oldJit,  new  .width,  new  Jit) 
uj:har  *im-data; 

int  old.width,  oldJit,  new.width,  new  Jit; 

{ 

u.char  *8mjdata,  *ptrl,  +ptr2,  *nuILptr  =  0; 
regi8ter  int  i,  j; 

int  widthjdiv,  width.adj,  left.adj,  right.adj,  ht.div, 
bottom.adj; 

smjdata  =  (u_char  *)malloc(new_width*newJit); 
if(8mjdata  ==  NULL)  { 
perror("malloc"); 
return  nulLptr; 

} 

if  ( (old.width  <new  .width)  |  j  ( old  Jit  <new  Jit) ) 

{ 

fprintf(8tderr,"Too  small  to  reduce. \n"); 
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return  null4>tr; 

} 

widthjdiv  =  old.width/new.width; 
width-adj  =  old_width%new_width; 
left^j  =  width  jadj/2; 

right_adj  =  new_width—left_adj -(width  julj%2); 
htjdiv  =  ((old_ht/new_ht)— 1); 
bottom_adj  =  newJit— (old_ht%new_ht); 

ptrl  =  smjdata; 
ptr2  =  imjdata; 
ptr2+= (width  jdiv/2) ; 
for(j=0;  j<new_ht;  j++) 

{ 

for(i=0;  i<new_width;  i++) 

{ 

*ptrl++  =  *ptr2; 
ptr2+=widthjdiv; 
if((i<left^j)||(i>right^j)) 
ptr2++; 

} 

ptr2+=(old_width*htjdiv); 

if(j>bottom-adj) 

ptr2+=old_width; 

} 

printf("z_re<iuce  complete\n\n"); 
return  smjdata; 

} 


62 


B.ll  z_storeJmage.c 


/♦**♦*♦♦******♦****♦***♦*****♦**♦♦*♦****♦♦♦♦♦*♦♦***♦******♦♦♦******♦***♦* 

*  File:  z^toreJmage.c 

*  Created:  8  July  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  The  code  is  intended  to  store  image  data  in  a  Sle 

*  which  is  labelled  <Blename>  which  is  pass  in. 

*  Retinms  a  1  is  successful;  -1  if  not. 

* 

*  Assumes: 

* 

*  ModiBed: 


*  By  : 

*  Why: 

**********#***********>(‘************************=t‘***********************/ 


#include  <atdio.h> 

^include  <8y3/types.h> 

#include  "vfc.lib.h" 

#include  "globals.h" 

int  zj8toreJmage(image4>tr,  filename,  size) 
uxhar  *image4)tr; 
char  ♦filename; 
int  size; 


{ 

FILE  ♦datJile; 

/* 

♦  Open  image  Bles  for  writing  images 

♦  into  memory;  check  to  make  sure  they  open  okay. 
*/ 

if  ((dat_file  =  fopen(fiIename,"w"))  ==  NULL) 

{ 

printf("I  can't  open  the  '/.s  file\n", filename); 
return  —1; 

} 

/♦  print f(” Opened  %s  Ble\n” ,Blename); 

V 

I* 

♦  Put  the  data  into  a  Ble;  close  Ble. 

*/ 

if  (fwrite(image4)tr,  1,  size,  datJile)  ==  0){ 
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perror("write  failed"); 
return  —1;; 


} 

fflush(stdout); 

fclose(dat-file); 

printf ( " z_8t  or e  _ image  complet  e \n\n" ) ; 
return  1; 

} 
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B.12  globals.h 

A 

*  File:  globads.h 

*  Created:  August  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  Put  all  global  variables  and  deSnitions  in  one  place. 

* 

*  Assumes:  vfcJib.h  is  also  included  -  all  the  vfc  routines  and 

*  vfc  deSnitions  are  in  vfcJib.b. 

* 

*  ModiSed: 

*  By: 

*  Why: 

*1 

#define  PORT  "VFC.SVIDEO" 

#define  YUV^IZE  VFCJJTSCJHEIGHT*VFC_YUV-WIDTH*2  J*  720  x  480  x  2hpp 
*1 

#define  NTSC-SIZE  VFC_NTSC.WIDTH*VFCJJTSCJIEIGHT  /♦  640  x  480  */ 

#define  RED.WIDTH  128 

#define  RED  JIEIGHT  96 

#define  RED^IZE  RED.WIDTH*RED-HEIGHT 

#define  SM.WIDTH  32 

#define  SM  JIEIGHT  32 

#define  SMJSIZE  SM.WIDTH*SM-HEIGHT 

#define  MOTION.THRESHOLD  3000 

struct  region 

{ 

int  x; 

int  y; 

int  width; 

int  height; 

struct  region  *next; 

}; 


struct  hwxontrols 

{ 

VfcDev  ♦vfc.dev; 
int  colormapjoffset; 
}; 


struct  image4)trs 

{ 

ujchar  ♦imagel; 

ujchar  ♦iniage2; 


65 


ujchar 

}: 


♦motion; 


struct  head^trs 

{ 

ujchar  ♦head; 

struct  head4>tr8  ♦next; 

}; 
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B.13  grab.c 

A 

*  File:  grab.c 

*  Created:  8  July  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  The  code  is  intended  to  take  in  images  in  a  loop. 

*  Each  pass  thru  the  loop  takes  in  YUV  image  data, 

*  converts  the  data  to  8- bit  grey,  square  pixel  data,  and 

*  save  the  8-bit  data  in  a  £le  labelled  with  person’s  name. 

*  The  number  of  images  (up  to  ten)  and  the  root  of  the 

*  Blenames  are  entered  during  execution. 

* 

*  Assumes: 

* 

*  Modified: 

*  By: 

*  Why: 

*/ 

#include  <stdio.h> 

#include  <sys/(ypes.h> 

#include  "vfc_lib.li" 

#include  "globals.h" 

extern  VfcDev  *zjset.vfc_hw(); 
extern  uxhar  *z.grab.gra(); 
extern  int  zjstoreJmage(); 

void  grab(); 

main() 

{ 

char  fiie[30]; 
int  niunimages; 

printfC’Enter  the  filename  root:  "); 

gets(file); 

printf("\n"); 

printf("Enter  the  number  of  images  (10  max):  "); 

scanf ("  %d  ", &num  images ) ; 

printf("\n"); 

grab(file,  numJmages); 
exit(O); 

} 
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void 

grab(naine,  niimJoop) 
char  name[20]; 
int  numJoop; 

{ 

ujchar  *grey[10]; 

register  int  j; 
struct  hwjcontrols  ♦hw.ptrs; 
char  filename[30]; 

hw4)trs  =  (struct  hwjcontrols  *)z-set_vfc_hw(); 

if(numJoop>10) 

{ 

numJoop  =  10; 

printf( “Sorry,  limited  to  10  images\n"); 

} 

for(j=l;  j<numJoop;  j++) 

{ 

greyp]=(ujchar  *)z^ab^a(hw4)trs); 
if(grey[j]  ==  NULL) 

{ 

printf("ptr  is  null\n"); 
exit(l); 

} 

} 

A 

♦  Create  name  for  image  data  and  store  in  file. 

V 

forG=l;  j<n\imJoop;  j++) 

{ 

sprintf(filename,"y.sy,dy,s",name  j," .  gra"); 
if(zjtoreJmage(greyp],  filename,  NTSCJSIZE)  <  0) 

^rintf(stderr, "Unable  to  write  Xs  to  f ile\n",filen^^ne); 

} 

I* 

*  Now  destroy  the  hardware  and  free  the  memory. 

*1 

vfcjdestroy(hw4)trs— ►vfcjdev); 

free(hw4)trs); 

for(j=l;  j<numJoop;  j++) 
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free(grey[j]); 


return; 
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B.  14  grabrgb.c 

I* 

*  File:  grabrgb.c 

*  Created:  July  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  The  code  is  intended  to  take  in  images  in  a  loop  num  Joop 

*  times.  Each  pass  thru  the  loop  takes  in  YUV  image  data, 

*  converts  the  data  to  8-bit  color,  square  pixel  data,  and 

*  save  the  8-bit  data  in  a  61e  labelled  with  person's  name. 

*  Number  of  loops  (num  Joop)  and  person's  name  (person) 

*  are  entered  during  execution. 

* 

*  Assumes:  Signal  coming  in  S- Video  port,  NTSC  format 

* 

*  ModiSed: 

*  By: 

*  Why: 

V 

#include  <stdio.h> 

#include  <sy8/types.h> 

#include  "vfc.lib.h" 

extern  VfcDev  *zjset.vfc_hw(); 
extern  u_char  *z^abj-gb(); 
extern  int  zj9toreJmage(); 

extern  int  ntscjsize; 

void  grabrgbO; 

main() 

{ 

char  file[30]; 
int  numJmages; 

printf("Enter  the  filename  root:  "); 

gets(file); 

printf("\n"); 

printf("Enter  the  number  of  images  (10  max):  "); 

scanf(  "%d"  ,&num  images) ; 

printf("\n"); 

grabrgb(file,  numimages); 
exit(O); 
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} 


void 

grabrgb(nanie,  numJoop) 
char  name[20]; 
int  numJoop; 

{ 

ujchcu:  *color[10]; 

register  int  j; 

VfcDev  *vfc_dev; 

char  filename[30]; 

vfcjdev  =  (VfcDev  ♦)zjset.vfc_hw(); 

if(num  Joop>  10) 

{ 

numJoop  =  10; 

printf( "Sorry,  limited  to  10  imageaXn"); 

} 

for(j=l;  j<numJoop;  j++) 

{ 

colorp]=(ujchar  ♦)z_grabj:gb(vfcjdev); 
if(color[j]  ==  NULL) 

{ 

printf("ptr  is  null\n"); 
exit(l); 

} 

} 

A 

♦  Create  name  for  image  data  and  store  in  file. 

V 

for(j=l;  j<numJoop;  j-»-+) 

{ 

8printf(filenaune,  "XsXdXs  "  ,name  j," .  rgb"); 
if(zjstoreJmage(coIorp],  filename,  (ntsc-size*4))  <  0) 

fprintf(stderr, "Unable  to  write  7,s  to  file\n", filename); 

} 

A 

*  Now  destroy  the  hardware  and  free  the  memory. 

*/ 

vfc  jdestroy  ( vfc  jdev) ; 
for(j=l;  j<numJoop;  j++) 
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fre€(color[j]); 


rettim; 


B.15  motion.c 

A 

*  File:  motion.c 

*  Created:  July  1992 

*  By:  Kevin  Gay 

* 

*  Purpose: 

* 

*  Assumes: 

* 

*  Modified: 

*  By: 

*  Why: 

V 

^include  <stdio.h> 

^include  <sys/types.h> 

^include  "vlc.lib.h" 

#include  "globals.h" 

extern  struct  image^jtrs  *zjnotion(); 
extern  int  zjstoreJmage(); 

mainO 

{ 

ujchar  *face,  *ptr; 

register  int  i; 

struct  region  ♦head-regions,  ♦face.ptr; 
struct  image-ptrs  ♦im.ptrs; 
char  filename[?0]; 


im4)trs  =  (struct  image-ptrs  ♦)z_motion(); 
if(im4>trs  ==  NULL) 

{ 

printf("Trouble  getting  images\n"); 
exit(l); 

} 

sprintf(filename,  "Xs  " ,  "motion .  gra" ); 

if(zjstore-image(im4)trs-  motion,  filename,  NTSC-SIZE)  <  0) 
fprintf(stderr, "Unable  to  write  Xs  to  file\n", filename); 

sprintf(filename,  "Xs  " , "  image  1 .  gra" ) ; 

if(z.storeJmage(im-ptrs— ‘imagel,  filename,  NTSC-SIZE)  <  0) 
fprintf(stderr,"Unable  to  write  Xs  to  file\n", filename); 
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sprintf(filename,'7.8","iinage2 ,  gra"); 
if(zjtoreJmage(im-ptrs->image2,  filename,  NTSC_SIZE)  <  0) 
fprintf(8tderr, "Unable  to  write  '/.s  to  file\n", filename); 

free(im4>trs— ►imagel); 
free(im4)trs— +image2) ; 
free(im^tr8-+motion) ; 
fi'ee(im4Jtrs); 
free(head_regions) ; 

} 
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B.16  rgb_motion.c 
/* 

*  File:  rgb-motion.c 

*  Created:  July  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  The  code  is  intended  to  take  in  images  in  a  loop  numJoop 

*  times.  Each  pass  thru  the  loop  takes  in  YUV  image  data, 

*  converts  the  data  to  8-bit  color,  square  pixel  data,  and 

*  save  the  8-bit  data  in  a  file  labelled  with  person’s  name. 

*  Number  of  loops  (numJoop)  and  person’s  name  (person) 

*  are  entered  during  execution. 

* 

*  Assumes:  Signal  coming  in  S- Video  port,  NTSC  format 

* 

*  Modified: 

*  By: 

*  Why: 

*1 

^include  <stdio.h> 

#include  <sys/types.h> 

#include  "vfc.lib.h" 

#include  "globala.h" 

#define  DIFFERENCE.THRESHOLD  10 

extern  struct  hwjcontrols  ♦z-set.vfcJiwO; 
extern  u_char  *z_grab_rgb(); 

extern  int  z^toreJmageO; 

main() 

{ 

ujchar  *color[2],  +im-ptr,  *ptr2,  *ptr3,  *ptr4, 

*red[2],  *green[2],  ♦blue[2], 

+ptr2a,  *ptr3a,  *ptr4a,  *motion; 
register  int  j,  k; 
struct  hw_controls  *hw_ptrs; 
char  filename  [30]; 

hw4)trs  =  (struct  hwjcontrols  *)z^et_vfc-hw(); 

for(j=0;  j<2;  j++) 

{ 

color[j]={uxhM  *)z-grab_rgb(hw_ptrs); 
if(coIor[j|  ==  NULL) 

{ 
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printf("ptr  is  null\n"); 
exit(l); 

} 

} 

A 

*  Create  name  for  image  data  and  store  in  file. 

V 

for(j=0;  j<2;  j++) 

{ 

sprintf(filename,  "%s'l,d*/,a  " , "  image  "  j, " .  rgb" ); 
if(zj8toreJmage(color[j],  filename,  (NTSCJSIZE*4))  <  0) 

fprintf(std[eiT, "Unable  to  write  %s  to  file\n", filename); 

} 


for(j=0;  j<2;  j++) 

{ 

blue[j]  =  (ujchar  *)malloc(NTSC^IZE); 
if(blnep]  ==  NULL) 

{ 

perrorC'malloc"); 

exit(l); 

} 

greenp]  =  (uxhar  *)malloc(NTSC-SIZE); 
if(green[j]  ==  NULL) 

{ 

peiTor("malloc"); 

exit(l); 

} 

red[jj  =  (ujchar  *)malloc(NTSCJSIZE); 
if(redLj]  ==  NULL) 

{ 

perror  ( "mallo  c  " ) ; 
exit(l): 

} 

im4)tr  =  coIor[j]; 
ptr2  =  blue[j]; 
ptr3  =  green[j]; 
ptr4  =  redjj]; 

for(k=0;  k<NTSC-SIZE;  k++) 

{ 

im.ptr++; 
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*ptr2+4-  =  *iin-ptr+4-; 

*ptr3++  =  *im-ptr++; 

*ptr4++  =  *im^tr++; 

} 

8printf(filename, "  y.s*/.dy.s  " ,  "blue"  j," .  gra") ; 
if(zjstoreJmage(blue[j],  fileucime,  (NTSCJSIZE))  <  0) 

fprintf(stderr, "Unable  to  write  %s  to  file\n", filename); 
sprintf (filename, "  y.s%dy  s  " , "  gr  e  en"  j, " .  gra" ) ; 
if(zjstore_image(green[j],  filename,  (NTSCJSIZE))  <  0) 

fprintf(stderT, "Unable  to  write  Xs  to  file\n", filename); 
sprintf  (filename, "  y.sy.dy.s  red"  j, gra" ) ; 

if(zjstoreJmage(red[j],  filename,  NTSCJSIZE)  <  0) 

fprintf(stderr, "Unable  to  write  */,s  to  file\n", filename); 


} 

/♦  motion  =  (u_char  *)maIIoc(NTSCJSIZE); 
if(motion==NULL  ) 
exit(l); 

im^tr  =  motion; 

ptr2  =  blue[0]; 

ptr2a  =  blueflj; 

ptr3  =  green[0]; 

ptrSa,  =  gTeen[lj; 

ptr4  =  red[0]; 

ptr4a  =  red[lj; 

for0=0;  j<NTSCJ5IZE;  j++) 

{ 

if((abs(*ptr2a-h+  -  *ptr2-h+)> DIFFERENCE.THRESHOLD)  || 
(abs(*ptr3a+-h  -  *ptr3-‘-+)> DIFFERENCE.THRESHOLD)  || 
(abs(*ptr4a+-h  -  *ptr4++)>DIFFERENCE.THRESHOLD)) 
*im.ptr-f-f-  =  0; 
else 

*im.ptr-h+  =  255; 

} 

if(zjstoreJmage(motion,”rgbjnotion.gra”,  NTSC^IZE)  <  0) 
fprintf(stderr,” Unable  to  write  motion  to  Sle\n”); 

*/ 

I* 

*  Now  destroy  the  hardware  and  free  the  memory. 

*/ 

vfc  jdestroy  (hw_ptrs— ♦vfc  jdev) ; 
for(j=0;  j<2;  j++) 

{ 

free(color[i]); 

free(red[j]); 
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free(green[j]); 

free(blue[j]); 

} 

free(motion); 


retvum; 
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B.  17  seg-test.c 


A 

*  File:  seg.test.c 

*  Created:  August  1992 

*  By:  Kevin  Gay 

* 

*  Purpose: 

* 

*  Assumes: 

* 

*  Modified: 

*  By: 

*  Why: 

V 

^include  <stdio.h> 

#include  <sys/typea.h> 

#include  "vfc.lib.h" 

#mclude  "global 3. h" 

extern  struct  image4)tr8  *zjnotion(); 
extern  struct  region  *Z-top^lopes(); 
extern  uxhar  *zj8egj'egions(); 

extern  uxhar  *z_reduce(); 

extern  int  Z-storeJmage(); 

void  seg_test(); 

main() 

{ 

seg_test(); 

exit(O); 

} 

void 

seg.testO 

{ 

char  newnanie[30]; 

ujchar  ♦face,  ♦ptr; 

struct  region  ♦temp^jtr,  ♦headJist; 

struct  image4)trs  ♦images; 

register  int  i; 

int  count; 

FILE  ♦datJile; 

for{i=l;  i<20;  i++) 
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{ 

images= (struct  image4)trs  *)z_motioii(); 
if(images==NULL) 

{ 

printf( "Trouble  getting  images\n"); 
exit(l); 

} 

headJist  =  (struct  region  *)  z_top-slopes(images— ^motion, 
VFC_NTSC_WIDTH,VFCJ^TSCJIEIGHT); 
if(hezwlJist  ==  NULL) 
printf("No  head  regions  found. \n"); 
else 
{ 

count=l; 

while(head-list  ^  NULL) 

{ 

face  =  (ujchar  ♦)zjsegjregions(images—>image2, headJist, 

VFC-NTSC-WIDTH,VFCJ^TSCJIEIGHT,SM_WIDTH,SMJIEIGHT); 
if(face  ==  NULL) 

printf("face  ptr  is  null\n");  ,  j 

} 

else 

{ 

A 

♦  Create  name  for  the  face  data  and  store  in  hie. 

*/ 

sprintf(newname, '"/.sXdXsXd'/s", "face",!,"-", count, "  .sm"); 
if(zjjtoreimage(face,  newname,  SM-SIZE)  <  0) 
fprintf(stderr, 

"Unable  to  write  7,a  to  f ile\n",newname); 

} 

temp4)tr  =  headJist; 
headJist  =  temp^jtr— »next; 
free(temp4)tr); 
count4-+; 

} 

} 

A 

♦  Create  name  for  the  motion  data  and  store  in  hie. 

*1 

sprintf(newn  ame,"7,d7,s%d7,3", MOTION-THRESHOLD, i, "  .gra"); 
if(z-storeJmage(image8— ^motion,  newname,  NTSCJSIZE)  <  0) 
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fprintf(8tderr, "Unable  to  write  %a  to  file\n",newname); 
free(image8— ^motion) ; 


A 

♦  Create  name  for  the  image  data  and  store  in  Sle. 

*1 

8printf (newname, "  y.s%dy.s  " , "  imaige  "  ,i, "  •  gra" ) ; 
if(z^tore-image(images— ♦image2,  newname,  NTSCJSIZE)  <  0) 
fprintf(8tderr, "Unable  to  write  %a  to  file\n", newname); 

fi:ee(image8— »imagel); 
free(image8— ♦image2) ; 

A 

V  Now  free  the  memory. 

*t 

free(images); 

free(face); 

} 


return; 

} 
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B.18  testJam.c 

A 

*  File:  testJbin.c 

*  Created:  8  July  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  The  code  tests  the  zJambert  routine. 

* 

*  Assumes: 

* 

*  Modified: 

*  By  : 

*  Why: 

*/ 

#include  <stdio.h> 

#include  <sya/typea.h> 

^include  "vfc^lib.h" 

^include  "globala.h" 

extern  Ujchar  *zJambert(); 
extern  ujchar  *z^ab_gra(); 
extern  uxhar  *zj:educe(); 
extern  int  zjjtoreimage(); 
extern  int  zjnedian(); 

ciain() 

{ 

char  filename[30],  newnarae(30]; 

u^ar  oimagel,  >)>image2,  ♦redjdata; 
regiater  int  i; 
int  check; 

FILE  ♦dat-file; 

printf("Enter  the  raw  8-bit  grey  file  (640x480)  to  lambertize:  ") 

geta(filenaine); 

printf("\n"); 

A 

*  Allocate  memory,  read  in  image,  and  reduce  it. 

V 

imagel  =  (u_char  ♦)malloc(NTSCjSIZE); 
if(imagel  ==  NULL) 

{ 

perror  ( "mallo  c  " ) ; 
exit(l); 


82 


} 


if  ((dat-file  =  fopen(filename,"r"))  ==  NULL) 

{ 

printf("I  can't  open  the  Xs  file\n", filename); 
exit(l); 

} 

if(check=fread(imagel,  1,  NTSC-SIZE,  dat^e)#NTSCJSIZE) 
printf( "Error  reading  in  file"); 
else 
{ 

for(i=4;  i<12;  i+=4) 

{ 

image2=(u-char  ♦)zJambert(imagel,VFC_NTSC_WIDTH, 
VFC-NTSC_HEIGHT,i,i); 
if(image2  ==  NULL) 

{ 

printf("laBibert  ptr  is  null\n"); 
exit(l); 

} 

/♦  sprintf(newnaine,  ”%s%d%s%d%s”,  ”var”, 

2*i+l,”x”  .gra”); 

if(zj3toreJmage(image2,  newname,  NTSC^IZE)  <  0) 
printf(” Unable  to  write  %s  to  {ile\n”, newname); 

*/ 

free(image2); 

} 

redjdata=(ujchar  *)zjreduce(imagel,VFC-NTSC_WIDTH, 

VFC_NTSC-HEIGHT,RED.WIDTH,REDJIEIGHT); 

if(redjdata==NULL) 

{ 

printf( "reduct ion  error\n"); 
exit(l); 

} 

/*if( zjstoreJmage( red-data,  ”im 1 28X96.  red  ”, RED  SIZE )<0) 
fprintffstderr,” Unable  to  write  red.daf'-  to  Rle\n”);*/ 
for(i=4;  i<12;  i+=4) 

{ 

image2=(u_char  ♦)zJainbert(redjdata,RED_WIDTH, 

REDJIEIGHT,i,i); 

if(image2  ==  NULL) 

{ 

printf( " lambert  ptr  is  null\n"); 
exit(l); 

} 

!*  sprintf(newname,”%s%d%s%d9(>s”,”red-var”. 
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2*i-hl”x’’,2*i-hl”.red”); 

if(z-StoTeJmage(image2,  newname,  (REDJSIZE))  <  0) 
printf(  Unable  to  write  %s  to  file\n”, newname); 

V 

free(image2); 

} 

/*6ree(Tedjdata);*/ 

} 

fclo8e(dat_file); 

free(imagel); 

return; 

} 
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B.19  time-grab.c 
/* 

*  File:  time^ab.c 

*  Created:  8  July  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  The  code  is  intended  to  take  in  images  in  a  loop. 

*  Each  pass  thru  the  loop  takes  in  YUV  image  data, 

*  converts  the  data  to  8-bit  grey,  square  pixel  data,  and 

*  save  the  8-bit  data  in  a  hie  labelled  with  person’s  name. 

*  The  number  of  images  ( up  to  ten )  and  the  root  of  the 

*  filenames  are  entered  during  execution. 

* 

*  Assumes: 

* 

*  Modified: 

*  By: 

*  Why: 

*1 

#include  <3tdio.h> 

#include  <sys/types.h> 

#mclude  "vfc_lib.li" 

#include  "globals.h" 

extern  struct  hwxontrols  *zjset_vfc_hw(); 
extern  Ujchar  *z^ab^a(); 

extern  int  z_storeJmage(); 

main() 

{ 

Ujchar  *grey[10]; 

register  int  j; 
struct  hw .controls  *hw_ptrs; 
char  filename  [30]; 

hw4)trs  =  (struct  hwjcontrols  *)z.set_vfcJiw(); 

system  ("date"); 

for(j=0;  j<20;  j++) 

{ 

grey[3]=(uxhar  *)z.grab.gra(hw4)trs); 
if(grey[j]  ==  NULL) 

{ 

printfC'ptr  is  null\n"); 
exit(l); 
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} 

} 

system  ("date"); 

A 

*  Create  name  for  image  data  and  store  in  Ble. 

*1 

/*  for(j=0;  j<  20;  j-h+) 

{ 

sprintf( filename,  ”  %s%d%s”,  ’’image”  j,  ”.gra"); 
if(z^toreJmage(greylj],  filename,  NTSC  SIZE)  <  0) 
fprintf(stderT,”Unable  to  write  %a  to  fi]e\n”, filename), • 

} 

*/ 

I* 

*  Now  destroy  the  hardware  and  free  the  memoiy. 

V 

vfcJestroy(hw4)trs— »vfcjdev); 
free(hw4)trs); 
for(j=0;  j<20;  j++) 
free(greyljl); 

retium; 

} 
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B.20  z-binarize-gra.c 

I* 

*  File:  z-binarize^a.c 

*  Created:  9  July  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  The  code  is  intended  to  allocate  memory  and  store 

*  binary  image  data  (255  or  0  depending  on  the  THRESHOLD). 

*  The  code  return  a  pointer  to  the  binary  data;  the 

*  pointer  will  point  to  NULL  if  an  error  has  occurred. 

* 

*  Assumes:  The  image  is  the  same  size  as 

*  the  8  bit  gray  image  data  (both  1  byte/pixel). 

* 

*  Modified: 

*  By: 

*  Why: 

*1 

#include  <stdio.h> 

#include  <ay8/«ypes.h> 

#include  "vfc_lib.h" 

#define  THRESHOLD  75 

u.char  +z.blnarize-gra(gray_ptr,  size) 
u_char  tgray-ptr; 
int  size; 

{ 

uxhar  ♦bin.ptr,  ★terap.ptr,  *temp2-ptr,  *null4)tr  =  0; 
register  int  i; 

bin^tr  =  (ujchcir  ♦)malloc(size);  /*  8-bit  gray  and  binary  1  bpp  */ 
if(bin4)tr  ==  NULL) 

{ 

perror(  "malloc  " ) ; 
return  null4)tr: 

} 

temp4)tr  =  gray.ptr; 
temp24)tr  =  bin_ptr; 
for(i=0;  i<(size);  i++) 

{ 

if  (*temp4)tr+4-  <  THRESHOLD) 

*temp2-ptr++  =  0; 
else 

•temp2_ptr++  =  255; 
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} 


printf ( "  z_bin2u:  ize_gra  complet  e  \n\n  “ ) ; 
return  bin4>tr; 

} 
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B.21  z-grab_rgb.c 

f* 

*  File:  z_grab_rgb.c 

*  Created:  3  June  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  The  code  is  intended  to  allocate  memory  for  image  data, 

*  grab  YUV  image  data,  and  then 

*  convert  that  data  to  rgb  color,  square  pixel  data. 

*  The  code  return  a  pointer  to  the  rgb  color  data 

*  which  will  point  to  NULL  if  an  error  has  occurred. 

* 

*  Assumes:  H/W  has  been  initialized  and  signal  is  NTSC  format. 

*  YUV  data  is  2  bytes  per  pixel  (bpp),  rgb  color  1  bpp. 

* 

*  Modihed: 

*  By: 

*  Why: 

*1 

#include  <stdio,h> 

#include  <3ys/types.h> 

^include  "vfc.lib.h" 

#define  YUV^IZE  VFC  J^TSC  JIEIGHT*VFC-YUV-WIDTH*2  /*  720  x  480  x  2bpp  */ 

extern  int  ntsc^ize; 
extern  int  colormapjoffset; 

ujchar  *z_grab-rgb(hw^tr) 

VfcDev  *hw_ptr; 


{ 

Ujchar  *yuv_data,  *im-data,  *null^tr  =  NULL; 
register  int  i; 

I* 

♦  Allocate  space  for  images. 

V 

yuvjdata  =  (ujchar  *)malloc(YUVJSIZE); 
if(yuvjdata  ==  NULL)  { 
perror("malloc"); 
return  null_ptr; 

} 

im^ata  =  (ujchar  *)maJloc(ntscjsize*4); 
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if(imjdata  ==  NULL)  { 
perror("malloc''); 
free(yuvjdata); 
return  null^tr; 

} 


f*  printf(”YUV  and  color  image  memories  created\n”); 

*1 

A 

*  Do  a  grab  and  read  in  an  image 

V 

if(vfc^ab(hw^tr)  <  0)  { 

A 

♦  If  the  grab  fails  the  image  read  will 

*  be  garbage. 

*/ 

fprintf(stderr, "Warning,  grab  failed\n"); 

fre€(yuvjdata); 

free(imjdata); 

return  null4)tr; 

} 

if(vfc^vreadjq(hw4)tr,  yuv-data,  VFC_NTSC)  <  0)  { 
fprintf(stderT, "Warning,  vfc_yuvread  failed\n"); 
free(yuvjdata); 

&ee(imjdata); 
return  null4>tr; 

} 

A  printf(”yUV  image  grabbed  and  read  from  the  hardware\n”); 


A 

♦  Convert  the  YUV  data  into  8-bit  gray  square  pixel  data. 

V 

if  (colonnapjofFset  ^  0) 

vfcJnitJut(colonnapj)fFset); 
vfc_yuv2rgbjitsc(yuv_data,  inudata); 

/♦  printf(”YUV  data  converted  to  rgb  color  data\n”); 

*/ 

free(yuvjdata); 

printf( " z_gr ab^rgb  c  ompl et  e\n\n" ) ; 
return  inudata; 

} 
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B.22  zJambert.c 


*  File:  zJambert.c 

*  Created:  August  1992 

*  By:  Kevin  Gay 

* 

*  Purpose:  This  code  Gnds  local  variations  in  a  image  (thereby 

*  discounting  object  brightness)  with  a  variable  size  ” local 

*  box”.  The  code  moves  a  box  around  each  pixel  in  an  image, 

*  Snds  the  mean  of  the  pixel  values  in  the  box  (for  each 

*  pixel)  and  then  subtracts  the  mean  from  that  pixel  which  is 

*  surrounded  by  the  box.  The  box  is  specihed  all  the  pixels 

*  some  number  away  from  the  pixel  of  interest  in  the  positive 

*  and  negative  direction  along  both  the  horizontal  and  vertical 

*  axes;  the  number  of  pixels  in  the  horizontal  direction  is 

*  passed  in  as  hor;  the  vertical,  vert. 

*  A  threshold  is  then  set  for  the  variations  to  identify  regions 

*  both  negative  and  bright  with  regard  to  their  surroundings. 

*  The  code  to  create  a  grey  scale  image  by  adding  128  to  the 

*  variations  found  is  left  in,  but  commented  out. 

* 

*  Assumes: 

* 

*  ModiSed: 

*  By: 

*  Why: 

:ti4!**1c*:t’******************************************************************/ 

#include  <stdio.h> 

#include  <sys/types.h> 

^include  "vfc_lib,h" 

#include  "globals.h" 

extern  ujchar  *zj8toreJmage(); 

u_char  *z Jambert(imjdata,  width,  height,  hor,  vert) 

ujchar  >t>im.data; 

int  width,  height,  hor,  vert; 

{ 

ujchar  *lam-data,  *im_ptr,  *lam.ptr,  ♦null4)tr=0, 

♦variation-data,  *variation_ptr, 

♦mean-data,  ♦mean.ptr; 
register  int  i,  j,  k,  m; 
int  sum,  mean,  check, 

col  jum[VFC_NTSC  JIEIGHT]  [VFC-NTSC-WIDTH] , 
neigh[VFC-NTSCJIEIGHT][VFC-NTSC-WIDTH]; 
char  newname(30]; 
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/*  lam-data  =  (ujcbar  *)malloc(widtb*bejght); 
if(lamjdata  ==  NULL) 

{ 

perror(”maIloc”); 
return  mill.ptr; 

} 

*! 

vanation_data  =  (u-char  *)malloc(width*height); 
if(variationjdata  ==  NULL) 

perror("malloc"); 

return  nTill4)tr; 

} 

im_ptr=im-data; 

for(i=0;  i<width;  i++) 

{ 

coljum[0][i]=0; 

for(j=0;  j<(2*vert+l);  j++) 

coljsum[0][i]+=*(im4)tr+(width*j)+i); 


for(i=0;  i<width— (2*hor+l);  i++) 
neigh[0][i]=0; 

forO=0;  j<(2*hor+l);  j++) 
neigh  [0]  (ij + =col  jsum[0]  [j] ; 


for(i=l;  i<(height-(2*vert+l));  i++) 
for(j=0;  j<width;  j++) 

col_suin[i][j]=coUuin[i-l][j]  -  *(im-ptr+(width*(i-l))+j)  + 
♦(im.ptr+(width*(i+(2*vert)))+j); 
neigh[i][0]=0; 

for(j=0;  j<(2*hor+l);  j+-»-) 
neighfi]  [0]+=col  jium[i]  [j] ; 
for(j=l;  j<(width-(2*hor+l));  j+-f ) 

neigh(i][j]=neigh[ij[j-l]+col^um[i][j+2*hor]-col^um[i][j-l)- 


im4>tr=imjdata; 
variation^tr=variationjdata; 
/*  Iam-ptr=lamjdata; 
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*/ 

for(i=l;  i<height;  i++) 
for(j=l;  j<width;  j++) 

{ 

if((i>vert)&&(i<(heiglit— vert))&:&(j>hor)&&(j<(width— hor))) 

{ 

mean=neigh[i— vert]  [j— hor]/((  H-2*vert)  ♦(  H-2*hor) ) ; 
check  =  128  4-  ((*im-ptr++)  —  mean); 
if(check>255) 
check  =  255; 
else  if(check<0) 
check  =  0; 

/*  *Iain.ptr+-h  =  check; 

*1 

if(check<120) 

*variation.ptr++  =  0; 
else  if(check>136) 

*variation_ptr+4-  =  128; 
else 

♦variation_ptr++  =  255; 

} 

else 

{ 

/*  *lam.ptr++  =  255; 

*/ 

♦variatioii^tr+4'  =  255; 
im^tr+4-; 

} 

} 

/*  if((width==VFCJfTSC.WIDTH)&:&:(height==VFCJfTSCJIEIGHT) ) 

{ 

sprintf(newname,  "%s%d%s%d%s” ,  ”Iam”,2*hor-hl ,  ”x”, 

2*vert+l ,  ”.gra”); 

if(zjstoreJmage(Iam-data,nevmame,  (NTSC^IZE))  <  0) 
fprintf(stdeiT,” Unable  to  write  var.data  to  Gle\n”); 

} 

else  if((widtb==RED.WIDTn)&&(height==REDJIEIGHT) ) 

{ 

sprintf(newname,  ”%s%d%s%d%s”  ,”redJam”,2*hor-hl,”x” , 
2*vert+l”.red”); 

if(zj3toreJmage(Iam.data,newname,  (RED SIZE))  <  0) 
fprintf(stdeiT,” Unable  to  write  var.data  to  file\n”); 

} 

free(Iamjdata); 

*1 

printf("2_laobert  complete\n\n"); 
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return  variationjdata; 


B.23  z_side-edges.c 

*  File:  z^idejedgea.c 

*  Created:  August  1992 

*  By:  Kevin  Gay 

* 

*  Purpose: 

* 

*  Assumes: 

* 

*  ModiSed: 

*  By: 

^include  <stdio.h> 

^include  <sy8/types.h> 

#include  "vfc.lib.h" 

^include  "globals.h" 

#deHne  SLOPE.THRESHOLD  8 

extern  int  zjstoreimage(); 

struct  region  *z  jidejedge8(image,  width,  height) 
uxhar  oimage; 
int  width,  height; 

{ 

register  int  i,  j,  run,  threshold; 

int  slope[VFCJ^TSC-HEIGHTI(VFC-NTSC-WIDTH-l], 
count=0,  found,  leave  Joop,  repeat, 
notjsmaller,  rjcol,  Lcol,  temp,totaJ=0; 
uxhar  *im-ptr,  ♦im_ptr2,  *edge Jmage; 
char  newname[30]; 

struct  region  *poss_faces=0,  ♦ptr,  ♦ptr2; 

if  (width>VFC_NTSC.WIDTH) 

{ 

printf("Too  wide  for  routine\n'*); 
return  possJaces; 

} 

if  (height>VFC-NTSC_HEIGHT) 

{ 

printf("Too  high  for  routine\n"); 
return  possJfaces; 

} 
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/««4i4>*4>«*****4<«>)i***4i*  Find  slopes  from  side  *****♦♦♦*♦***♦**♦*♦/ 


edgeJmage  =  (ujchar  *)malloc(width*height); 
if(edgeJmage  ==  NULL) 

{ 

perror(  "malloc  " ) ; 
return  possJaces; 

} 

for(run=l;  run<l;  run++) 

{ 

ini4)tr2=edgeJmage; 
for(i=0;  i<height;  i++) 

{ 

im_ptr=iinage; 
im_ptr+=s  (width+i) ; 
for(j=0;  j<:(width— run);  j++) 

{ 

if(a+i)%4o==o) 

^rintf(dat-dle, "  \n" ) ; 

sIope(i]  [j] = ♦  (im4)tr+run) — *im4)tr++ ; 

if(abs(slope(i][j])>SLOPE-THRESHOLD) 

♦im_ptr2++  =  0; 
else 

*im_ptr24-+  =  255; 

} 

for(j=(width-run);  j<width;  j++) 

♦im_ptr24-+  =  255; 

} 

sprintf(newname,  "%s%dXs  " , "  edge  "  ,run, " .  gra" ); 
if(zj9toreJmage(edge  Jmage,  newname,  width*height)  <  0) 
fprintf(stderr, "Unable  to  write  V,s  to  filo\n", newname); 

} 


free(edgeJmage); 

printf("z_8ido.edges  complete\n\n"); 
return  pos8_faces; 

} 
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Appendix  C.  Segmentation  Test  Image  Set 


C.  1  General 

The  following  figures  is  the  complete  set  of  images  which  resulted  from  the 
segmentation  testing.  Each  row  of  these  figures  begin  with  a  grey  scale  image,  the 
associated  motion  image,  and  all  the  potential  face  images  segmented  from  the  input 
image  followed  by  a  reduced  image  of  just  the  segmented  region.  Thus,  there  are 
instances  when  a  segmented  region  image  with  its  reduced  image  are  shown  and  there 
is  no  grey  scale  or  motion  image  preceding  these  images  in  that  row.  These  instances 
represent  the  case  of  multiple  segmented  regions  from  a  single  motion  image.  As  such, 
the  grey  scale  and  motion  images  in  the  row  immediately  preceding  the  row  where 
there  is  no  grey  scale  or  motion  images  are  the  associated  grey  scale  and  motion 
images  for  those  segmented  regions  as  well.  A  complete  set  of  only  the  segmented 
regions  for  each  test  are  provided  for  comparison  at  the  end  of  this  appendix. 
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Figure  18.  Segmentation  Test  I  Images  (1  of  4)  [The  Lowest  Motion  Threshold 
Tested] 
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Figure  22.  Segmentation  Test  II  Images  (1  of  4)  [The  Middle  Motion  Threshold 
Tested] 
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Figure  23.  Segmentation  Test  II  Images  (2  of  4) 
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Figure  24.  Segmentation  Test  II  Images  (3  of  4) 
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Figure  25.  Segmentation  Test  II  Images  (4  of  4) 


105 


Figure  26.  Segmentation  Test  III  Images  (1  of  4)  [The  Highest  Motion  Threshold 
Tested] 
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Figure  27.  Segmentation  Test  III  Images  (2  of  4) 
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Figure  28.  Segmentation  Test  III  Images  (3  of  4) 


Figure  30.  Segmented  Regions  I  [The  Lowest  Motion  Threshold  Tested] 
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Figure  31.  Segmented  Regions  II  [The  Middle  Motion  Threshold  Tested] 
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Figure  32.  Segmented  Regions  III  [The  Highest  Motion  Threshold  Tested] 
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